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Quantum mechanics, information theory, and relativity theory are the basic foundations of theo- 
retical physics. The acquisition of information from a quantum system is the interface of classical 
and quantum physics. Essential tools for its description are Kraus matrices and positive operator 
valued measures (POVMs). Special relativity imposes severe restrictions on the transfer of infor- 
mation between distant systems. Quantum entropy is not a Lorentz covariant concept. Lorentz 
transformations of reduced density matrices for entangled systems may not be completely posi- 
tive maps. Quantum field theory, which is necessary for a consistent description of interactions, 
implies a fundamental trade-off between detector reliability and localizability. General relativity 
produces new, counterintuitive effects, in particular when black holes (or more generally, event 
horizons) are involved. Most of the current concepts in quantum information theory may then 
require a reassessment. 
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I. THREE INSEPARABLE THEORIES 

Quantum theory and relativity theory emerged at the 
beginning of the twentieth century to give answers to 
unexplained issues in physics: the black body spectrum, 
the structure of atoms and nuclei, the electrodynamics 
of moving bodies. Many years later, information theory 
was developed by Claude Shannon (1948) for analyzing 
the efficiency of communication methods. How do these 
seemingly disparate disciplines affect each other? In this 
review, we shall show that they are inseparably related. 

A. Relativity and information 

Common presentations of relativity theory employ fic- 
titious observers who send and receive signals. These 
"observers" should not be thought of as human be- 
ings, but rather ordinary physical emitters and detectors. 
Their role is to label and locate events in spacetime. The 
speed of transmission of these signals is bounded by c 
— the velocity of light — because information needs a 
material carrier^ and the latter must obey the laws of 
physics. Information is physical (Landauer, 1991). 

However, the mere existence of an upper bound on 
the speed of propagation of physical effects does not do 
justice to the fundamentally new concepts that were in- 
troduced by Albert Einstein (one could as well imagine 
communications limited by the speed of sound, or that 
of the postal service) . Einstein showed that simultaneity 
had no absolute meaning, and that distant events might 
have different time orderines when referred to observers 
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in relative motion. Relativistic kinematics is all about in- 
formation transfer between observers in relative motion. 

Classical information theory involves concepts such as 
the rates of emission and detection of signals, and the 
noise power spectrum. These variables have well defined 
relativistic transformation properties, independent of the 
actual physical implementation of the communication 
system. A detailed analysis by Jarett and Cover (1981) 
showed that the transmission rates for observers with rel- 
ative velocity v were altered by a factor {c + v)/{c — v), 
namely the square of the familiar Doppler factor for fre- 
quencies of periodic phenomena. We shall later derive 
the same factor from classical electromagnetic theory, see 
Eq. H36() below. Physics has a remarkably rigid theoret- 
ical structure: you cannot alter any part of it without 
having to change everything (Weinberg, 1992). 



B. Quantum mechanics and information 

Einstein's theory elicited strong opposition when it was 
proposed, but is generally accepted by now. On the other 
hand, the revolution caused by quantum theory still pro- 
duces uneasy feelings among some physicists.^ Standard 
texbooks on quantum mechanics tell you that observ- 
able quantities are represented by Hermitian operators, 
their possible values are the eigenvalues of these opera- 
tors, and that the probability of detecting eigenvalue A„, 
corresponding to eigenvector u„, is |(u„|7/')p, where tp is 
the (pure) state of the quantum system that is observed. 
With a bit more sophistication to include mixed states, 
the probability can be written in a general way {un\p\un). 

This is nice and neat, but this does not describe what 
happens in real life. Quantum phenomena do not occur 
in a Hilbert space; they occur in a laboratory. If you 
visit a real laboratory, you will never find there Hermi- 
tian operators. All you can see are emitters (lasers, ion 
guns, synchrotrons, and the like) and appropriate detec- 
tors. In the latter, the time required for the irreversible 
act of amplification (the formation of a microscopic bub- 
ble in a bubble chamber, or the initial stage of an electric 
discharge) is extremely brief, typically of the order of an 
atomic radius divided by the velocity of light. Once irre- 
versibility has set in, the rest of the amplification process 
is essentially classical. It is noteworthy that the time and 
space needed for initiating the irreversible processes are 
incomparably smaller than the macroscopic resolution of 
the detecting equipment.^ 



^ The theory of relativity did not cause as much misunderstanding 
and controversy as quantum theory, because people v^ere care- 
ful to avoid using the same nomenclature as in nonrelativistic 
physics. For example, elementary textbooks on relativity the- 
ory distinguish "rest mass" from "relativistic mass" (hard core 
relativists call them simply "mass" and "energy"). 

^ The "irreversible act of amplification" is part of the quantum 
folklore, but it is not essential to physics. Amplification is solely 



The experimenter controls the emission process and 
observes detection events. The theorist's problem is to 
predict the probability of response of this or that de- 
tector, for a given emission procedure. It often happens 
that the preparation is unknown to the experimenter, and 
then the theory can be used for discriminating between 
different preparation hypotheses, once the detection out- 
comes are known. 

Quantum mechanics tells us that whatever comes from 
the emitter is represented by a state p (a positive oper- 
ator,'^ usually normalized to unit trace). Detectors are 
represented by positive operators i?^, where fi is an arbi- 
trary label which identifies the detector. The probability 
that detector ^ be excited is ir{pE^). A complete set 
of E^, including the possibility of no detection, sums up 
to the unit matrix and is called a positive operator val- 
ued measure (POVM). The various do not in general 
commute, and therefore a detection event does not cor- 
respond to what is commonly called the "measurement 
of an observable." Still, the activation of a particular de- 
tector is a macroscopic, objective phenomenon. There is 
no uncertainty as to which detector actually clicked. 

Many physicists, perhaps a majority, have an intuitive 
realistic worldview and consider a quantum state as a 
physical entity. Its value may not be known, but in prin- 
ciple the quantum state of a physical system would be 
well defined. However, there is no experimental evidence 
whatsoever to support this naive belief. On the contrary, 
if this view is taken seriously, it may lead to bizarre con- 
sequences, called "quantum paradoxes." These so-called 
paradoxes originate solely from an incorrect interpreta- 
tion of quantum theory. The latter is thoroughly prag- 
matic and, when correctly used, never yields two contra- 
dictory answers to a well posed question. It is only the 
misuse of quantum concepts, guided by a pseudorealistic 
philosophy, that leads to paradoxical results. 

In this review we shall adhere to the view that p is 
only a mathematical expression which encodes informa- 
tion about the potential results of our experimental in- 
terventions. The latter are commonly called "measure- 
ments" — an unfortunate terminology, which gives the 
impression that there exists in the real world some un- 
known property that we are measuring. Even the very 
existence of particles depends on the context of our ex- 
periments. In a classic article, Mott (1929) wrote "Until 
the final interpretation is made, no mention should be 
made of the a-ray being a particle at all." Drell (1978) 
provocatively asked "When is a particle?" In particular, 
observers whose world lines are accelerated record differ- 
ent numbers of particles, as will be explained in Sec. V.D 
(Unruh, 1976; Wald, 1994). 



needed to facilitate the work of the experimenter. 
^ Positive operators are those having the property that (i/'|p|V') 1^ 
for any state These operators are always Hermitian. 
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C. Relativity and quantum theory 

The theory of relativity deals with the geometric struc- 
ture of a four-dimensional spacctime. Quantum mechan- 
ics describes properties of matter. Combining these two 
theoretical edifices is a difficult proposition. For exam- 
ple, there is no way of defining a relativistic proper time 
for a quantum system which is spread all over space. A 
proper time can in principle be defined for a massive 
apparatus ( "observer" ) whose Compton wavelength is so 
small that its center of mass has classical coordinates and 
follows a continuous world-line. However, when there is 
more than one apparatus, there is no role for the private 
proper times that might be attached to the observers' 
world-lines. Therefore a physical situation involving sev- 
eral observers in relative motion cannot be described by 
a wave function with a relativistic transformation law 
(Aharonov and Albert, 1981; Peres, 1995, and references 
therein). This should not be surprising because a wave 
function is not a physical object. It is only a tool for com- 
puting the probabilities of objective macroscopic events. 

Einstein's principle of relativity asserts that there are 
no privileged inertial frames. This does not imply the 
necessity or even the possibility of using manifestly sym- 
metric four-dimensional notations. This is not a pecu- 
liarity of relativistic quantum mechanics. Likewise in 
classical canonical theories, time has a special role in the 
equations of motion. 

The relativity principle is extraordinarily restrictive. 
For example, in ordinary classical mechanics with a fi- 
nite number of degrees of freedom, the requirement that 
the canonical coordinates q have the meaning of posi- 
tions, so that particle trajectories q(t) transform like 
four-dimensional world lines, implies that these lines con- 
sist of straight segments. Long range interactions are for- 
bidden; there can be only contact interactions between 
point particles (Currie, Jordan, and Sudarshan, 1963; 
Leutwyler, 1965). Nontrivial relativistic dynamics re- 
quires an infinite number of degrees of freedom which 
are labelled by the spacetime coordinates (this is called 
a field theory). 

Combining relativity and quantum theory is not only 
a difficult technical question on how to formulate dynam- 
ical laws. The ontologies of these theories are radically 
different. Classical theory asserts that fields, velocities, 
etc., transform in a definite way and that the equations 
of motion of particles and fields behave covariantly. For 
example if the expression for the Lorentz force is written 
fn = FnvU" in one frame, the same expression is valid in 
any other frame. These symbols (/^, etc.) have objective 
values. They represent entities that really exist, accord- 
ing to the theory. On the other hand, wave functions 
are not defined in spacetime, but in a multidimensional 
Hilbert space. They do not transform covariantly when 
there are interventions by external agents, as will be seen 
in Sec. IIL Only the classical parameters attached to 
each intervention transform covariantly. Yet, in spite of 
the non-covariance of p, the final results of the calcula- 



tions (the probabilities of specified sets of events) must 
be Lorentz invariant. 

As a simple example, consider our two observers, con- 
ventionally called Alice and Bob,^ holding a pair of spin- 
^ particles in a singlet state. Alice measures cr^ and finds 
+1, say. This tells her what the state of Bob's particle 
is, namely the probabilities that Bob would obtain ±1 if 
he measures (or has measured, or will measure) <r along 
any direction he chooses. This is purely counterfactual 
information: nothing changes at Bob's location until he 
performs the experiment himself, or receives a message 
from Alice telling him the result that she found. In par- 
ticular, no experiment performed by Bob can tell him 
whether Alice has measured (or will measure) her half of 
the singlet. 

A seemingly paradoxical way of presenting these re- 
sults is to ask the following naive question: suppose that 
Alice finds that az = 1 while Bob does nothing. When 
does the state of Bob's particle, far away, become the one 
for wliic;li = —I with certainty? Though this question 
is meaningless, it may be given a definite answer: Bob's 
particle state changes instantaneously. In which Lorentz 
frame is this instantaneous? In any frame! Whatever 
frame is chosen for defining simultaneity, the experimen- 
tally observable result is the same, as can be shown in a 
formal way (Peres, 2000b). Einstein himself was puzzled 
by what seemed to be the instantaneous transmission of 
quantum information. In his autobiography, he wrote the 
words "telepathically" and "spook" (Einstein, 1949). 

Examples like the above one, taken from relativistic 
quantum mechanics, manifestly have an informational 
nature. We cannot separate the three disciplines: rel- 
ativity, quantum mechanics, and information theory. 



D. The meaning of probability 

In this review, we shall often invoke the notion of prob- 
ability. Quantum mechanics is fundamentally statistical 
(Ballentine, 1970). In the laboratory, any experiment 
has to be repeated many times in order to infer a law; 
in a theoretical discussion, we may imagine an infinite 
number of replicas of our gedankenexperiment, so as to 
have a genuine statistical ensemble. Yet, the validity of 
the statistical nature of quantum theory is not restricted 
to situations where there are a large number of similar 
systems. Statistical predictions do apply to single events. 
When we are told that the probability of precipitation 
tomorrow is 35%, there is only one tomorrow. This tells 
us that it may be advisable to carry an umbrella. Prob- 
ability theory is simply the quantitative formulation of 
how to make rational decisions in the face of uncertainty 



* Alice and Bob joined the quantum information community after 
a distinguished service in classical cryptography. For example, 
they appeared in the historic RSA paper (Rivest, Shamir, and 
Adleman, 1978). 
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(Fuchs and Peres, 2000). A lucid analysis of how prob- 
abilistic concepts are incorporated into physical theories 
is given by Emch and Liu (2002). 



E. The role of topology 

Physicists often tend to ignore the topological struc- 
ture of the concepts that they use, or turn to it only as a 
last resort. Actually, there is a "bewildering" multitude 
of topologies (Reed and Simon, 1980). Many of them 
have a direct physical meaning (Emch 1972; Haag, 1996; 
Araki, 1999). In particular, since measurements can ac- 
tually be performed only with a finite accuracy, a finite 
number of outcomes, and a finite number of times, only 
bounded ranges of values are ever registered. Suppose 
that we measure N times the value q of an observable Q, 
and a value qj is obtained Uj times. A relative frequency 
Wj = rij/N is either used to extract a probability esti- 
mate, or it is taken at face value and interpreted as the 
estimate. Thus the information about a state p can be 
formulated as (Araki, 1999; Peres and Terno, 1998) 

\P'^{U)-Wj\<ej, (1) 

for some positive ej . These inequalities induce a natural 
topology on the space of states, which is called a "physi- 
cal topology" (Emch, 1972; Araki, 1999). More precisely, 
they define a weak-* topology on the observables and a 
weak topology on the states. This is a trace-norm topol- 
ogy^ (Reed and Simon, 1980). These structures are nat- 
urally accommodated in the algebraic approach to quan- 
tum theory. That approach consists in the characteriza- 
tion of the theory by a net of algebras of local observ- 
ables, and is especially suited for the analysis of infinite 
systems in quantum statistical mechanics and quantum 
field theory. We will use results based on algebraic field 
theory in Sec. V and VI. ^ 



F. The essence of quantum information 

In an early review of quantum information theory. In- 
garden (1976) distinguished two fundamental aspects: 



Since probabilities in quantum mechanics are given by the ex- 
pression tr {pEij, ) , and physically acceptable states are trace class 
positive operators, the trace norm topology is the concrete real- 
ization of the physical topology. 

References whose primary interest is field theory include Bogol- 
ubov et al. (1990), Haag (1996) and Araki (1999). On the 
other hand, Davies (1976), Bratteli and Robinson (1987), and 
Ingraden, KossaJsowski and Ohaya (1997), consider mainly ap- 
plications to open quantum systems, statistical mechanics and 
thcrrnodyiiairiics. Emch (1972) is concerned with both. Emch 
(1972), Bratelli and Robinson (1987), and Baumgartel and Wol- 
Icnbcrg (1992) give a rigorous, and yet readable exposition of the 
subject. 



"Information theory, as it is understood in 
this paper and as it usually understood by 
mathematicians and engineers following the 
pioneer paper of Shannon, is not only a the- 
ory of the entropy concept itself (in this as- 
pect information theory is most interesting 
for physicists), but also a theory of transmis- 
sion and coding of information, i.e., a theory 
of information sources and channels." 

In other words: the goals of quantum information the- 
ory are the intersection of those of quantum mechan- 
ics and information theory, while its tools are the union 
of those of these two theories. Actually, the tools be- 
longing to quantum theory were developed under the in- 
fluence of nascent quantum information, "when it was 
necessary to consider communication problems for the 
needs of quantum of quantum electronic and optics" (In- 
garden, 1976). Work of Sudarshan et al. (1961), and 
later those of Davies, Kossakowski, Kraus, Lindblad, and 
Lewis established the formalism of quantum mechanics 
of open systems, expressed by POVMs and completely 
positive maps, while works of Helstrom, Holevo, Lebe- 
dev, and Levitin, produced important results in what 
became quantum information theory.'' We shall discuss 
these subjects in Sec. II of this review. 

Some trends in modern quantum information theory 
may be traced to security problems in quantum com- 
munication. A very early contribution was Wiesner's 
seminal paper Conjugate Coding, which was submitted 
circa 1970 to IEEE Transactions on Information Theory, 
and promptly rejected because it was written in a jar- 
gon incomprehensible to computer scientists (this actu- 
ally was a paper about physics, but it had been submitted 
to a computer science journal). Wiesner's article was fi- 
nally published (Wiesncr, 1983) in the newsletter of ACM 
SIGACT (Association for Computing Machinery, Special 
Interest Group in Algorithms and Computation Theory) . 
That article tacitly assumed that exact duplication of an 
unknown quantum state was impossible, well before the 
no-cloning theorem (Wootters and Zurek, 1982; Dieks, 
1982) became common knowledge. Another early arti- 
cle, Unforgeable Subway Tokens (Bennett et al., 1983), 
also tacitly assumed the same. 

The standard method for quantum cryptography was 
invented by Bennett and Brassard (1984), using two 
mutually unbiased bases, namely two bases such that 
{um\Vfj.) = 1/Vd, where d is the number of Hilbert 
space dimensions. Security may be improved by us- 
ing three bases (Brufi, 1998; Bechmann-Pasquinucci and 
Gisin, 1999), and even more by going to higher dimen- 
sions (Bechmann-Pasquinucci and Peres, 2000; Brui3 and 
Macchiavello 2002). Gisin, Ribordy, Tittel and Zbinden 



^ The books of Davies (1976), Holevo (1982), and Ingarden, Kos- 
sakowski and Ohaya (1997), contain historical surveys and ex- 
haustive lists of references. 
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(2002) recently reviewed theoretical and experimental re- 
sults in quantum cryptography. 

A spectacular discovery was that of quantum telepor- 
tation (Bennett et al.., 1993), which effectively turned 
quantum entanglement into a communication resource. 
Soon afterwards, it also became a computational resource 
(Shor, 1994) and since then it continues to attract consid- 
erable attention. Various aspects of entanglement theory 
are reviewed in special issues of Quantum Information 
and Computation (2001) 1 (1) and Journal of Mathe- 
matical Physics (2002) 43 (9). Experimental results were 
reviewed by Zcilingcr (1999). 

Quantum binary channels were introduced by Schu- 
macher (1995), who also generalized Shannon's coding 
theorems to the quantum domain, and coined the word 
"qubit" (quantum bit) for elementary carriers of quan- 
tum information. Quantum channels are discussed by 
Holevo (1999), Amosov, Holevo, and Werner (2000), 
King and Ruskai (2001), and in the special issue of Jour- 
nal of Mathematical Physics (2002) 43 (9). An extensive 
review of the mathematical aspects of quantum informa- 
tion theory was given by Keyl (2002). 

Our review deals with many interrelated issues. 
Causality constraints on POVMs are discussed in 
Sec. II. E. Relativistic extensions of the formalism appear 
in Sec. Ill and VI. A. In Sec. IV we discuss how relativistic 
considerations modify basic notions of quantum informa- 
tion theory: qubits, entanglement, and quantum chan- 
nels. In Sec. V we investigate the implications of quan- 
tum field theory on the construction of POVMs and the 
detection of entanglement. Section VI.A deals with rel- 
ativistic extensions of quantum information theory, and 
in Sec. VLB we discuss its applications to the black hole 
physics. 

II. THE ACQUISITION OF INFORMATION 
A. The ambivalent quantum observer 

Quantum mechanics is used by theorists in two differ- 
ent ways: it is a tool for computing accurate relation- 
ships between physical constants, such as energy levels, 
cross sections, transition rates, etc. These calculations 
are technically difficult, but they are not controversial. 
Besides this, quantum mechanics also provides statisti- 
cal predictions for results of measurements performed on 
physical systems that have been prepared in a specified 
way. The quantum measuring process is the interface of 
classical and quantum phenomena. The preparation and 
measurement are performed by macroscopic devices, and 
these are described in classical terms. The necessity of 
using a classical terminology was emphasized by Niels 
Bohr (1927) since the very early days of quantum me- 
chanics. Bohr's insistence on a classical description was 
very strict. He wrote (1949): 

". . . by the word 'experiment' we refer to a 
situation where we can tell others what we 



have done and what we have learned and that, 
therefore, the account of the experimental ar- 
rangement and of the results of the observa- 
tions must be expressed in unambiguous lan- 
guage, with suitable application of the termi- 
nology of classical physics." 

Note the words "we can tell." Bohr was concerned 
with information, in the broadest sense of this term. He 
never said that there were classical systems or quantum 
systems. There were physical systems, for which it was 
appropriate to use the classical language or the quantum 
language. There is no guarantee that either language 
gives a perfect description, but in a well designed exper- 
iment it should be at least a good approximation. 

Bohr's approach divides the physical world into "en- 
dosystems" (Finkelstein, 1988) that are described by 
quantum dynamics, and "exosystems" (such as measur- 
ing apparatuses) that are not described by the dynam- 
ical formalism of the endosystem under consideration. 
A physical system is called "open" when parts of the 
universe are excluded from its description. In different 
Lorentz frames used by observers in relative motion, dif- 
ferent parts of the universe may be excluded. The sys- 
tems considered by these observers are then essentially 
different, and no Lorentz transformation exists that can 
relate them (Peres and Terno, 2002). 

It is noteworthy that Bohr never described the measur- 
ing process as a dynamical interaction between an exo- 
physical apparatus and the system under observation. He 
was of course fully aware that measuring apparatuses are 
made of the same kind of matter as everything else, and 
they obey the same physical laws. It is therefore tempt- 
ing to use quantum theory in order to investigate their 
behavior during a measurement. However, if this is done, 
the quantized apparatus loses its status of a measuring 
instrument. It becomes a mere intermediate system in 
the measuring process, and there must still be a final in- 
strument that has a purely classical description (Bohr, 
1939). 

Measurement was understood by Bohr as a primitive 
notion. He could thereby elude questions which caused 
considerable controversy among other authors. A quan- 
tum dynamical description of the measuring process was 
first attempted by John von Neumann, in his treatise on 
the mathematical foundations of quantum theory (1932). 
In the last section of that book, as in an afterthought, 
von Neumann represented the apparatus by a single de- 
gree of freedom, whose value was correlated to that of the 
dynamical variable being measured. Such an apparatus 
is not, in general, left in a definite pure state, and it does 
not admit a classical description. Therefore, von Neu- 
mann introduced a second apparatus which observes the 
first one, and possibly a third apparatus, and so on, until 
there is a final measurement, which is not described by 
quantum dynamics and has a definite result (for which 
quantum mechanics can only give statistical predictions). 
The essential point that was suggested, but not proved by 
von Neumann, is that the introduction of this sequence 
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of apparatuses is irrelevant: the final result is the same, 
irrespective of the location of the "cut" between classical 
and quantum physics.^ 

These different approaches of Bohr and von Neumann 
were reconciled by Hay and Peres (1998), who introduced 
a dual description for the measuring apparatus. It obeys 
quantum mechanics while it interacts with the system 
under observation, and then it is "dequantized" and is 
described by a classical Liouville density which provides 
the probability distribution for the results of the mea- 
surement. Alternatively, the apparatus may always be 
treated by quantum mechanics, and be measured by a 
second apparatus which has such a dual description. The 
question raised by Hay and Peres is whether these two 
different methods of calculation give the same result, or 
at least asymptotically agree under suitable conditions. 
They showed that a sufficient condition for agreement 
between the two methods is that the dynamical variable 
used as a "pointer" by the first apparatus be represented 
by a "quasi-classical" operator of the Weyl-Wigner type 
(Hillery et al, 1984). 

To avoid any misunderstanding, we emphasize that the 
classical description of a pointer is not by moans of a 
point in phase space, but by a Liouville density. Quan- 
tum theory makes only statistical predictions, and any 
scmiclassical treatment that simulates it must also be 
statistical. 



B. The measuring process 

Dirac (1947) wrote "a measurement always causes the 
system to jump into an eigenstate of the dynamical vari- 
able being measured." Here, we must be careful: a quan- 
tum jump (also called collapse) is something that hap- 
pens in our description of the system, not to the system 
itself. Likewise, the time dependence of the wave func- 
tion docs not represent the evolution of a physical sys- 
tem. It only gives the evolution of probabilities for the 
outcomes of potential experiments on that system (Fuchs 
and Peres, 2000). 

Let us examine more closely the measuring process. 
First, we must refine the notion of measurement and 
extend it to a more general one: an intervention. An 
intervention is described by a set of parameters which 
include the location of the intervention in spacctime, re- 
ferred to an arbitrary coordinate system. We also have to 
specify the speed and orientation of the apparatus in the 
coordinate system that wc arc; using, and various other 
input parameters that control the apparatus, such as the 
strength of a magnetic field, or that of an rf pulse used 
in the experiment. The input parameters are determined 



At this point, von Neumann also speculated that the final step 
involves the consciousness of the observer — a bizarre statement 
in a mathematically rigorous monograph (von Neumann, 1955). 



by classical information received from past interventions, 
or they may be chosen arbitrarily by the observer who 
prepares that intervention, or by a local random device 
acting in lieu of the observer. 

An intervention has two consequences. One is the ac- 
quisition of information by means of an apparatus that 
produces a record. This is the "measurement." Its out- 
come, which is in general unpredictable, is the output 
of the intervention. The other consequence is a change 
of the environment in which the quantum system will 
evolve after completion of the intervention. For example 
the intervening apparatus may generate a new Hamilto- 
nian which depends on the recorded result. In particular, 
classical signals may be emitted for controlling the execu- 
tion of further interventions. These signals are of course 
limited to the velocity of light. 

The experimental protocols that we consider all start 
in the same way, with the same initial state po, and the 
first intervention is the same. However, later stages of the 
experiment may involve different types of interventions, 
possibly with different spacetime locations, depending on 
the outcomes of the preceding events. Yet, assuming that 
each intervention has only a finite number of outcomes, 
there is for the entire experiment only a finite number 
of possible records. (Here, the word "record" means the 
complete list of outcomes that occurred during the exper- 
iment. We do not want to use the word "history" which 
has acquired a different meaning in the writings of some 
quantum theorists.) 

Each one of these records has a definite probability in 
the statistical ensemble. In the laboratory, experimenters 
can observe its relative frequency among all the records 
that were obtained; when the number of records tends 
to infinity, this relative frequency is expected to tend to 
the true probability. The aim of theory is to predict the 
probability of each record, given the inputs of the vari- 
ous interventions (both the inputs that are actually con- 
trolled by the local experimenter and those determined 
by the outputs of earlier interventions). Each record is 
objective: everyone agrees on what happened (e.g., which 
detectors clicked). Therefore, everyone agrees on what 
the various relative frequencies are, and the theoretical 
probabilities arc also the same for everyone. 

Interventions are localized in spacetime, but quantum 
systems are pervasive. In each experiment, irrespective 
of its history, there is only one quantum system, which 
may consist of several particles or other subsystems, cre- 
ated or annihilated at the various interventions. Note 
that all these properties still hold if the measurement 
outcome is the absence of a detector click. It does not 
matter whether this is due to an imperfection of the de- 
tector or to a probability < 1 that a perfect detector 
would be excited. The state of the quantum system does 
not remain unchanged. It has to change to respect uni- 
tarity. The mere presence of a detector that could have 
been excited implies that there has been an interaction 
between that detector and the quantum system. Even if 
the detector has a finite probability of remaining in its 
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initial state, the quantum system correlated to the latter 
acquires a different state (Dicke, 1981). The absence of 
a click, when there could have been one, is also an event. 

Interventions, as defined above, start by an interaction 
with a measuring apparatus, called "premeasurement" 
(Peres, 1980). The quantum system and the apparatus 
are initially in a state "^gCs \s)i^\A), and become entan- 
gled into a single composite system C: 

^c,|s)® 1^) ->^c,C/,a|A), (2) 

where {|A)} is a complete basis for the states of C. It 
is the choice of the unitary matrix Us\ that determines 
which property of the system under study is correlated to 
the apparatus, and therefore is measured. When writing 
the above equation, we tacitly assumed that the quantum 
system and the measuring apparatus were initially in a 
pure state. Since a mixed state is a convex combination 
of pure states, no new feature can result from taking 
mixed states (which would admittedly be more realistic) . 
Relativistic restrictions on the allowed forms of Us\ will 
be discussed in Sec. III. 

The measuring process involves not only the physical 
system under study and a measuring apparatus (which 
together form the composite system C) but also their "en- 
vironment" which includes unspecified degrees of free- 
dom of the apparatus and the rest of the world. These 
unknown degrees of freedom interact with the relevant 
ones, but they are not under the control of the experi- 
menter and cannot be explicitly described. Our partial 
ignorance is not a sign of weakness. It is fundamental. If 
everything were known, acquisition of information would 
be a meaningless concept. 

A complete description of C involves both macroscopic 
and microscopic variables. The difference between them 
is that the environment can be considered as adequately 
isolated from the microscopic degrees of freedom for the 
duration of the experiment and is not influenced by them, 
while the environment is not isolated from the macro- 
scopic degrees of freedom. For example, if there is a 
macroscopic pointer, air molecules bounce from it in a 
way that depends on the position of that pointer. Even if 
we can neglect the Brownian motion of a massive pointer, 
its influence on the environment leads to the phenomenon 
of decoherence, which is inherent to the measuring pro- 
cess. 

An essential property of the composite system C, which 
is necessary to produce a meaningful measurement, is 
that its states form a flnite number of orthogonal sub- 
spaces which are distinguishable by the observer. Each 
macroscopically distinguishable subspace corresponds to 
one of the outcomes of the intervention and defines a 
POVM element Ef^, given explicitly by Eq. ((SJ below. 
Let us therefore introduce a complete basis for C, namely 
^)}, where labels a macroscopic subspace, and ^ la- 
bels microscopic states in that subspace. 



C. Decoherence 

Up to now, the quantum evolution is well defined and 
it is in principle reversible. It would remain so if the envi- 
ronment could be perfectly isolated from the macroscopic 
degrees of freedom of the apparatus. This demand is of 
course self-contradictory, since we have to read the re- 
sult of the measurement if we wish to make any use of it. 
A detailed analysis of the interaction with the environ- 
ment, together with plausible hypotheses (Peres, 2000a), 
shows that states of the environment that are correlated 
to subspaces of C with different labels fi can be treated 
as if they were orthogonal. This is an excellent approx- 
imation (physics is not an exact science, it is a science 
of approximations). The resulting theoretical predictions 
will almost always be correct, and if any rare small de- 
viation from them is ever observed, it will be considered 
as a statistical quirk, or an experimental error. 

The density matrix of the quantum system thus is ef- 
fectively block-diagonal and all our statistical predictions 
are identical to those obtained for an ordinary mixture 
of (unnormalized) pure states 

|V'^)=^c,f/,^,.|M,0, (3) 

where the statistical weight of each state is the square 
of its norm. This process is called decoherence. Each 
subspace /i is stable under decoherence — it is their rel- 
ative phase that decoheres. From this moment on, the 
macroscopic degrees of freedom of C have entered into the 
classical domain. We can safely observe them and "lay 
on them our grubby hands" (Caves, 1982). In particu- 
lar, they can be used to trigger amplification mechanisms 
(the so-called detector clicks) for the convenience of the 
experimenter. 

Some authors claim that decoherence may provide a 
solution of the "measurement problem," with the partic- 
ular meaning that they attribute to that problem (Zurek, 
1991). Others dispute this point of view in their com- 
ments on the above article (Zurek, 1993). A reassessment 
of this issue and many important technical details were 
recently pubhshed by Zurek (2002, 2003). Yet, decoher- 
ence has an essential role, as explained above. It is es- 
sential to distinguish decoherence, which results from the 
disturbance of the environment by the apparatus (and is 
a quantum effect), from noise, which would result from 
the disturbance of the system or the apparatus by the en- 
vironment and would cause errors. Noise is a mundane 
classical phenomenon, which we ignore in this review.^ 



The so-called "quantum noise" which is discussed in Sec. IV. C 
has a different nature. 
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D. Kraus matrices and POVMs 

The final step of the intervention is to discard part of 
the composite system C. The discarded part may de- 
pend on the outcome /i. We therefore introduce in the 
subspace two sets of basis vectors a) and m) for 
the new system and the part that is discarded, respec- 
tively. We thus obtain for the new system a reduced 
density matrix 

(Pm)-^^ = XI ^i^t^rn)as Pst (A;^)rt, (4) 
m s,t 

where pst = Cgcl is the initial state, and the notation 

was introduced for later convenience. Recall that the 
indices s and a refer to the original system under study 
and to the final one, respectively. Omitting these indices, 
Eq. Q takes the familiar form 

P'tJ-^^ P ^Im> (6) 

m 

where /l( is a label that indicates which detector was in- 
volved and the label m refers to any subsystem that was 
discarded at the conclusion of the interaction. Clearly, 
the "quantum jump" p ^ is not a dynamical pro- 
cess that occurs in the quantum system by itself. It re- 
sults from the introduction of an apparatus, followed by 
its deletion or that of another subsystem. A jump in 
the quantum state occurs even when there is no detector 
click or other macroscopic amplification, because we im- 
pose abrupt changes in our way of delimiting the object 
that we consider as the quantum system under study. 

The initial p is usually assumed to be normalized to 
unit trace, and the trace of p'^ is the probability of oc- 
currence of outcome p. Note that each symbol A^m in 
the above equation represents a matrix (not a matrix 
element). Explicitly, the Kraus operators (Kraus, 
1983) are given by Eq. jSJl, where Usuam is the matrix 
element for the unitary interaction between the system 
under study and the apparatus, including any auxiliary 
systems that are subsequently discarded (Peres, 2000a). 

Equation © is sometimes written p'^ = iSp, where S 
is a linear superoperator which acts on density matrices 
like ordinary operators act on pure states. Note however 
that these superoperators have a very special structure, 
explicitly given by Eq. I^. 

It is noteworthy that Eq. © is the most general com- 
pletely positive hnear map (Stinespring, 1955; Davies, 
1976; Kraus, 1983). This is a crucial property: a linear 
map T{p) is called positive if it transforms any positive 
matrix p (namely, one without negative eigenvalues) into 
another positive matrix. It is called completely positive if 
(T ® 1) acting on a bipartite p produces a valid bipartite 
p. For instance, complex conjugation of p (whose mean- 
ing is time reversal) is a positive map. However, it is not 



completely positive. If we have two systems, it is physi- 
cally meaningless to reverse the direction of time for only 
one of them. One can write a formal expression for this 
impossible process, but the resulting "density matrix" 
is unphysical because it may have negative eigenvalues 
(Peres, 1996). The case for consideration of completely 
positive maps was made by Kraus (1971), Davies (1976) 
and Lindblad (1976), and since than they are part of the 
toolbox of quantum information. In Sec. lIV.El we discuss 
apparent exceptions to this approach. 

It follows from Eq. lO that the probability of occur- 
rence of outcome p is 

PM-IItr(A^„p4„)-tr(pS^). (7) 

m 

The positive operators 

£^M=E4m^Mm- (8) 
rri 

whose dimensions are the same as those of the initial 
p, satisfy = \ owing to the unitarity of Us^am ■ 

Therefore they are the elements of a POVM. Conversely, 
given (a positive matrix of order k) it is always pos- 
sible to split it in infinitely many ways as in the above 
equation. 

In the special case where the POVM elements E^ com- 
mute, they are orthogonal projection operators, and the 
POVM becomes a projection valued measure (PVM). 
The corresponding intervention is sometimes called a 
von Neumann measurement. Rigorous treatment of the 
POVM formalism can be found in the books of Davies 
(1976), Holevo (1982), and Kraus (1983). 

E. The no-communication theorem 

We now derive a sufficient condition that no instan- 
taneous information transfer can result from a distant 
intervention. We shall show that the condition is 

[A^m,S,„]=0, (9) 

where A^^m and are Kraus matrices for the observa- 
tion of outcomes p by Alice and i' by Bob. Indeed, the 
probability that Bob gets a result i', irrespective of what 
Alice found, is 

P'^=J2^''{J2 ^/^™ p 4™ ^In) ■ (10) 

We now make use of Eq. @ to exchange the positions 
of Af^m and B^n, and likewise those of Ajj^^ and -Bj„, 
and then we move A^m from the first position to the last 
one in the product of operators in the traced parenthesis. 
We thereby obtain expressions as in Eq. ©. These are 
elements of a POVM that satisfy — 1. Therefore 

Eq. l(Tn|l reduces to 

p, = tr(j2B.npBl^y (11) 

n 
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whence all expressions involving Alice's operators 
have totally disappeared. The statistics of Bob's result 
are not affected at all by what Alice may simultaneously 
do somewhere else. This proves that Eq. © indeed is 
a sufficient condition for no instantaneous information 
transfer. ^'^ 

Note that any classical communication between distant 
observers can be considered as a kind of long range inter- 
action. Indeed, it is always possible to treat their appa- 
ratuses as quantum systems (von Neumann, 1932; Bohr, 
1939) and then any signals that propagate between these 
apparatuses are a manifestation of their mutual interac- 
tion. The propagation of signals is of course bounded by 
the velocity of light. As a result, there exists a partial 
time ordering of the various interventions in an exper- 
iment, which defines the notions earlier and later (we 
assume that there are no closed causal loops) . The input 
parameters of an intervention are deterministic (or pos- 
sibly stochastic) functions of the parameters of earlier in- 
terventions, but not of the stochastic outcomes resulting 
from later or mutually spacelike interventions (Blanchard 
and Jadczik, 1996 and 1998; Percival, 1998). 

Even these apparently simple notions lead to non- 
trivial results. Consider a separable bipartite superop- 
erator T, 

Tip) = MkpMl , Mk^Ak(^Bk, (12) 

k 

where the operators Ak represent operations of Alice, 
and Bk those of Bob. It was shown by Bennett et al. 
(1999) that not all such superoperators can be imple- 
mented by local transformations and classical communi- 
cation (LOCC). For more on this subject, see Walgate 
and Hardy (2002). 

A classification of bipartite state transformations was 
introduced by Beckman et al. (2001). It consists of 
the following categories. There are localizahle opera- 
tions that can be implemented locally by Alice and Bob, 
possibly with the help of prearranged entangled auxil- 
iary systems (ancillas), but without classical comunica- 
tion. Ideally, local operations are instantaneous, and the 
whole process can be viewed as performed at a definite 
time. For semilocalizable operations, the requirement of 
no communication is relaxed and one-way classical com- 
munication is possible. It is obvious that any tensor- 
product operation Ta ^ 7b is localizable. The converse 
is not always true, for example in Bell measurements 
(Braunstein, Mann, and Revzen, 1992) which distinguish 
between the four standard bipartite entangled states, 

|vl'±>:=4(|0)|l)±|l)|0)), (13) 



An algebraic approach to statistical independence and to related 
topics is discussed by Florig and Summers (1997), while Neu- 
mann and Werner (1983) specifically address the issue of causal- 
ity between preparation and registration processes. 



|ci>±):=-^(|0)|0)±|l)|l)). (14) 

Other classes of bipartite operations are defined as fol- 
lows: Bob performs a local operation Tb just before the 
global operation T. If no local operation of Alice can 
reveal any information about Tb, i.e., Bob cannot signal 
to Alice, then the operation T is semicausal. If the oper- 
ation is semicausal in both directions, it is called causal. 

In many cases it is easier to prove causality than local- 
izability. To check the causality of an operation T whose 
outcomes are the states = Tfj^{p)/p^ with probabilities 
= tr T)( (p) , it is enough to consider the corresponding 
superoperator 

T'ip):^Y.T,ip). (15) 

Indeed, assume that Bob's action prior to the global op- 
eration leads to one of the two different states pi and p2 . 
Then the states T'(pi) and T'{p2) are distinguishable 
if and only if some of the pairs of states T^{pi)/p^i and 
Tfj,{p2)/P(i2 are distinguishable. Such probabilistic distin- 
guishability shows that the operation T is not semicausal. 
These definitions of causal and localizable operators ap- 
pear equivalent. It is easily proved that localizable oper- 
ators are causal. It was shown that semicausal operators 
are always semilocalizable (Eggeling, Schlingemann, and 
Werner, 2002). However, there are causal operations that 
are not localizable (Beckman et ai, 2001). 

It is curious that while a complete Bell measurement 
is causal, the two-outcome incomplete Bell measurement 
is not (Sorkin, 1993). Indeed, consider a two-outcome 
PVM 

£;i = !$+)($+!, ^2 = 1-^^1, (16) 

where 1$+) = (|00) + |ll))/\/2 (and the Kraus matri- 
ces are the projectors themselves) . If the initial state 
is |01)ab, then the outcome that is associated with E2 
always occurs and Alice's reduced density matrix after 
the measurement is pA = |0)(0|. On the other hand, 
if before the joint measurement Bob performs a unitary 
operation that transforms the state into |00)ab, then the 
two outcomes are equiprobable, the resulting states after 
the measurement are maximally entangled, and Alice's 
reduced density matrix is pA = 5 1- It can be shown that 
two input states |00)ab and |01)ab after this incomplete 
Bell measurement are distinguished by Alice with a prob- 
ability of 0.75. 

Here is another example of a semicausal and semilocal- 
izable measurement which can be executed with one-way 
classical communication from Alice to Bob. Consider a 
PVM measurement, whose complete orthogonal projec- 
tors are 

|0)®|0), |0)®|1), |l)®|+>, |1>®|-), (17) 
where |±) — (|0) ± |1))/V2. The Kraus matrices are 

A^,=E^S,o. (18) 
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From the properties of complete orthogonal measure- 
ments (Beckman et at, 2001), it follows that this opera- 
tion cannot be performed without Alice talking to Bob. 
A protocol to realize this measurement is the following. 
Alice measures her qubit in the basis {|0), |1)}, and tells 
her result to Bob. If Alice's outcome was |0), Bob mea- 
sures his qubit in the basis {|0), |1)}, and if it was |1), in 
the basis {|+>, |-)}. 

Beckman et al. (2001) derived necessary and sufficient 
conditions to check the semicausality (and therefore, the 
causality) of PVM measurements. Groisman and Reznik 
(2002) allowed for more complicated conditional state 
evolutions. In particular, they were interested in verifica- 
tion measurements, i.e., those yielding /i with certainty 
if the state prior to the classical intervention is p oc 
but without making any specific demand on the resulting 
state p'^. They showed that all PVM verification mea- 
surements on 2 X 2 dimensional systems are localizable. 

Vaidman (2003) proposed a realization of verification 
measurements by means of a shared entangled ancilla, 
and Bell-type measurements by one of the parties. A 
verification measurement of the states in Eq. (|17|l will 
illustrate his construction. Alice and Bob share a Bell 



where the four Bell states are given by Eqs. (|13|l and 
l)14|l . and the symbol |^*^^)) means the state \^) rotated 
by TT around the 2;-axis, etc. Thus, the Bell measurement 
performed on the two particles at Alice's site leads to 
one of the branches of the superposition on the rhs of 
Eq. 1)19(1 . To complete the teleportation. Bob performs 
a rotation by tt around one of the axes according to the 
classical information he gets from Alice. 

Gauge theories also lead to interesting questions about 
measurability. Wilson loops, which are nonlocal objects 
by definition, are often invoked in their presentation (Pe- 
skin and Schroeder, 1995) and are the backbone of lattice 
gauge theories (Makeenko, 2002). Beckman et al. (2002) 
investigated the measurability of the Wilson loop opera- 
tors. 

The impossibility of instantaneous communication al- 
lows to circumvent the theoretical impossibility of quan- 
tum bit commitment (Mayers, 1997; Lo and Chau, 1997). 
Kent (1999, 2003) developed protocols based on the finite 
speed of communication and evaluated their communica- 
tion costs and security. In particular Kent's RBC2 proto- 
col allows a bit commitment to be indefinitely maintained 
with unconditional security against all classical attacks, 
and at least for some finite amount of time against quan- 
tum attacks (Kent, 2003). 



state \^~) and, contrary to the scheme of Beckman et al. 
(2001), they do not have to coordinate their moves. Alice 
and Bob perform their tasks independently and convey 
their results to a common center, where the final analy- 
sis is made. In the first step of this measurement, Alice 
performs a Bell measurement as in the teleportation of a 
state l^*) from her site to Bob (see below). However, Al- 
ice and Bob do not perform the full teleportation which 
requires a classical communication between them. The 
second step of the verification is executed by Bob. He 
measures the spin of his particle in the z direction. Ac- 
cording to whether that spin is up or down, he measures 
the spin of his ancilla in the z oi x direction, respectively. 
This completes the measurement and it only remains to 
combine the local outcomes to get the result of the non- 
local measurement (Vaidman, 2003). This method can 
be extended to arbitrary Hilbert space dimensions. 

In the teleportation of an unknown state \^){) of a 
spin-i particle located at Alice's site, Alice and Bob use 
a prearranged pair in a singlet state, namely |\l/~)i2 = 
(|0)i|l)2 - |l)i|0)2)/V2. The procedure is based on the 
identity (Bennett et al., 1993) 



(19) 

I 

III. THE RELATIVISTIC MEASURING PROCESS 
A. General properties 

Quantum measurements are usually considered as 
quasi-instantaneous processes. In particular, they affect 
the wave function instantaneously throughout the en- 
tire configuration space. Measurements of finite duration 
(Peres and Wootters, 1985) make no essential difference 
in this respect. Is this quasi-instantaneous change of the 
quantum state, caused by a local intervention of an ex- 
ophysical agent, consistent with relativity theory? The 
answer is not obvious. The wave function itself is not 
a material object forbidden to travel faster than light, 
but we may still ask how the dynamical evolution of an 
extended quantum system that undergoes several mea- 
surements in distant spacetime regions is described in 
different Lorentz frames. 

Difficulties were pointed out long ago by Bloch (1967), 
Aharonov and Albert (1981, 1984), and many others 
(Peres, 1995 and references therein). Still before them, 
in the very early years of quantum mechanics, Bohr and 
Rosenfeld (1933) had given a complete relativistic theory 
of the measurement of quantum fields, but these authors 
were not concerned about the properties of the new quan- 
tum states that resulted from these measurements and 
their work does not answer the question that was raised 



I 

|*)o|*")l2 = \ (|*-)0l|*)2 + |* + )0l|*^'^)2 + |$-)0l|*("^)2 + |$+)0l|*^^^)2) , 
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above. Other authors (Scarani et ai, 2000; Zbinden et 
ai, 2001) considered detectors in relative motion, and 
therefore at rest in different Lorentz frames. These works 
also do not give an explicit answer to the above ques- 
tion: a detector in uniform motion is just as good as one 
that has undergone an ordinary spatial rotation. (Ac- 
celerated detectors involve new physical phenomena, see 
Sec. IV. Dl ) The point is not how individual detectors 
happen to move, but how the effects due to these detec- 
tors are described in different ways in one Lorentz frame 
or another. 

To become fully relativistic, the notion of intervention 
requires some refinement. The precise location of an in- 
tervention, which is important in a relativistic discussion, 
is the point from which classical information is sent that 
may affect the input of other interventions. More pre- 
cisely, it is the earliest small region of spacetime from 
which classical information could have been sent. More- 
over, in the conventional presentation of non-relativistic 
quantum mechanics, each intervention has a (finite) num- 
ber of outcomes, for example, this or that detector clicks. 
In a relativistic treatment, the spatial separation of the 
detectors is essential and each detector corresponds to a 
different intervention. The reason is that if several de- 
tectors are set up so that they act at a given time in one 
Lorentz frame, they would act at different times in an- 
other Lorentz frame. However, a knowledge of the time 
ordering of events is essential in our dynamical calcula- 
tions, so that we want the parameters of an intervention 
to refer unambiguously to only one time (indeed to only 
one spacetime "point"). Therefore, an intervention can 
involve only one detector and it can have only two possi- 
ble outcomes: either there was a "click" or there wasn't. 

What is the role of relativity theory here? We may 
likewise ask what is the role of translation and/or rota- 
tion invariance in a nonrelativistic theory. The point is 
that the rules for computing quantum probabilities in- 
volve explicitly the spacetime coordinates of the inter- 
ventions. Lorentz invariance (or rotational invariance, as 
a special case) says that if the classical spacetime coordi- 
nates are subjected to a particular linear transformation, 
then the probabilities remain the same. This invariance 
is not trivial because the rule for computing the proba- 
bility of occurrence of a given record involves a sequence 
of mathematical operations corresponding to the time or- 
dered set of all the relevant interventions. 

If we only consider the Euclidean group, all we have 
to know is how to transform the classical parameters, 
and the wave function, and the various operators, under 
translations and rotations of the coordinates. However, 
when we consider genuine Lorentz transformations, we 
have not only to Lorentz-transform the above symbols, 
but we are faced with a new problem: the natural way 
of calculating the result of a sequence of interventions, 
namely by considering them in chronological order, is dif- 
ferent for different inertial frames. The issue is not only a 
matter of covariance of the symbols at each intervention 
and between consecutive interventions. There are gen- 



uinely different prescriptions for choosing the sequence 
of mathematical operations in our calculation. Therefore 
these different orderings ought to give the same set of 
probabilities, and this demand is not trivial. 




FIG. 1 In this spacetime diagram, the origins of the coordi- 
nate systems are the locations of the two tests. The ti and 
t2 axes are the world lines of the observers, who are receding 
from each other. In each Lorentz frame, the zi and Z2 axes 
are isochronous: ti — and t2 — 0, respectively. 



B. The role of relativity 

A typical example of relativistic measurement is the 
detection system in the experimental facility of a mod- 
ern high energy accelerator. Following a high energy col- 
lision, thousands of detection events occur in locations 
that may be mutually space-like. Yet, some of the detec- 
tion events are mutually time-like, for example when the 
world line of a charged particle is recorded in an array 
of wire chambers. In a relativistic context, the term "de- 
tector" strictly means an elementary detecting element, 
such as a bubble in a bubble chamber, or a small segment 
of wire in a wire chamber. 

A much simpler example of space-like separated in- 
terventions, which is amenable to a complete analysis, 
is Bohm's version of the Einstcin-Podolsky-Rosen "para- 
dox" (hereafter EPRB; Einstein, Podolsky, and Rosen, 
1935; Bohm 1951) which is sketched in Fig. 1, with two 
coordinate systems in relative motion (Peres, 1993). In 
that experiment, a pair of spin-i particles, prepared in 
a singlet state, move apart and are detected by two ob- 
servers. Each observer measures a spin component along 
an arbitrarily chosen direction. The two interventions are 



^ High energy physicists use a different language. For them, an 
"event" is one high energy collision together with all the subse- 
quent detections that are recorded. This "event" is what we call 
here an experiment (while they call "experiment" the complete 
experimental setup that may be run for many months). And 
their "detector" is a huge machine weighing thousands of tons. 
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mutually space-like as shown in the figure. The test of 
Six occurs first when recorded in ti-time, and the test of 
S2y is the first one in t2-tinie. The evolution of the quan- 
tum state of this bipartite system appears to be genuinely 
different when recorded in two Lorentz frames in relative 
motion. The quantum states are not Lorentz-transforms 
of each other. Yet, all the observable results are the same. 
Consistency of the theoretical formalism imposes definite 
relationships between the various operators used in the 
calculations (Peres, 2000b). In particular, it is sufficient 
for consistency that the Kraus operators satisfy equal- 
time commutation relation as in Eq. I^. The analogy 
with relativistic quantum field theory is manifest. 

In general, consider the quantum evolution from an 
initial state po to a final state pf. It is a completely 
positive map, 

P/=^A„poAt. (20) 

n 

The Lorentz transformation of the Kraus matrices An 
can be obtained as follows. We have pg — UpoU^ and 
p'j: = V pfV\ where U and V are unitary representations 
of Lorentz transformations for the systems represented 
by Po and p / (which may be of different nature and even 
of different dimensions). 

Lorentz invariance means that, in another frame, the 
Kraus matrices A'^ satisfy 

p} = ^A:,p^A;t. (21) 

n 

A simple solution is 

A'n = VAnU\ (22) 
but this is not the most general one. The latter is 

A'n^Y.^nVA„,U\ (23) 

m 

where is a unitary matrix that acts on the labels m, n 
(not on the Hilbert spaces of po a-nd pf). This arbitrari- 
ness is a kind of gauge freedom, and can be resolved only 
by a complete dynamical description of the intervention 
process. This, however, is an arduous problem. Rela- 
tivistic interactions necessarily involve field theory, and 
the question is how to generalize the quantum informa- 
tion tools (POVMs, completely positive maps) into ob- 
jects that are described by quantum field theories (Terno 
2002). 

At this stage we consider only field theories in 
Minkowski spacetime where a unique vacuum state |f2) 
is defined. The discrete indices that appear in the above 
equations can still be used, owing to the fact that the un- 
derlying Hilbert space is separable (Streater and Wight- 
man, 1964). Therefore the formalism is valid without 
change in the relativistic domain. However, not every 



The fact that the values of classical parameters ("measurable 



measurement-induced state transformation that can be 
written in the Kraus form is permitted or makes sense. 
Relativity theory prohibits superluminal velocity for ma- 
terial objects. Consistency with the requirements of co- 
variance and causality is an intrinsic feature of quantum 
field theories. Nevertheless, to make problems solvable, 
a patchwork of relativistic and non-relativistic theories 
is employed. For example, a measurement on relativis- 
tic systems is usually treated by introducing detectors 
that are described by non-relativistic quantum mechan- 
ics. Often these detectors are stripped to only a few dis- 
crete degrees of freedom (Unruh and Wald, 1984; Levin, 
Peleg and Peres, 1992; Wald, 1994). 

An external probe which is not described by field the- 
ory and whose coupling to the fields of interest is arbitrar- 
ily adjustable is obviously an idealization. Beckman et 
al. (2001) assert that if the probe variables are "heavy," 
with rapidly decaying correlations and the field variables 
are "light," then this idealization is credible. Still, causal- 
ity requirements like the absence of signalling should be 
checked for any proposed measurement scheme fSec. lH.ij!] 
also discusses causality requirements). 

Consider again the descriptions of the EPRB 
gedankenexperiment in two coordinate systems in rela- 
tive motion. There exists a Lorentz transformation con- 
necting the initial states po and pg before the two inter- 
ventions, and likewise there is a Lorentz transformation 
connecting the final states p/ and p'j after completion 
of the two interventions. On the other hand, there is 
no Lorentz transformation relating the states at inter- 
mediate times that are not in the past or future of both 
interventions (Peres, 2000b). The various Kraus opera- 
tors, acting at different times, appear in different orders. 
Nevertheless the overall transition from initial to final 
state is Lorentz invariant (Peres, 2001). 

In the time interval between the two interventions, 
nothing actually happens in the real world. It is only 
in our mathematical calculations that there is a deter- 
ministic evolution of the state of the quantum system. 
This evolution is not a physical process. ^'^ What distin- 
guishes the intermediate evolution between interventions 
from the one occurring at an intervention is the unpre- 
dictability of the outcome of the latter: either there is 
a click or there is no click of the detector. This un- 
predictable macroscopic event starts a new chapter in 



quantities" ) are finite real numbers is sufficient to construct prob- 
ability measures. For the exact formulation see Davies (1976) 
and Holevo (1982). Similar arguments justify the inclusion of 
only bounded operators into algebras of local observables (Haag, 
1996; Araki, 1999). 

Likewise, the quantum state of Schrodinger's legendary cat, 
doomed to be killed by an automatic device triggered by the de- 
cay of a radioactive atom, evolves into a superposition of "live" 
and "dead" states. This is a manifestly absurd situation for a 
real cat. The only meaning that such a quantum state can have 
is that of a mathematical tool for statistical predictions on the 
fates of numerous cats subjected to the same cruel experiment. 
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the history of the quantum system which acquires a new 
state, according to Eq. 



C. Quantum nonlocality? 

Phenomena like those illustrated in Fig. 1 are often at- 
tributed to "quantum nonlocality" and have led some 
authors to speculate on the possibility of superlumi- 
nal communication (actually, instantaneous communica- 
tion). One of these proposals (Herbert, 1981) looked rea- 
sonably serious and arose enough interest to lead to inves- 
tigations disproving this possibility (Glauber, 1986) and 
in particular to the discovery of the no-cloning theorem 
(Wootters and Zurek, 1982; Dieks, 1982). Let us examine 
more closely the origin of these claims of nonlocality. 

Bell's theorem (1964) asserts that it is impossible to 
mimic quantum theory by introducing a set of objec- 
tive local "hidden" variables. It follows that any classical 
imitation of quantum mechanics is necessarily nonlocal. 
However Bell's theorem does not imply the existence of 
any nonlocality in quantum theory itself. In particular 
relativistic quantum field theory is manifestly local. The 
simple and obvious fact is that information has to be 
carried by material objects, quantized or not. Therefore 
quantum measurements do not allow any information to 
be transmitted faster than the characteristic velocity that 
appears in the Green's functions of the particles emitted 
in the experiment. In a Lorcntz invariant theory, this 
limit is the velocity of light. 

In summary, relativistic causality cannot be violated 
by quantum measurements. The only physical assump- 
tion that is needed to prove this assertion is that Lorentz 
transformations of the spacetime coordinates are imple- 
mented in quantum theory by unitary transformations of 
the various operators. This is the same as saying that the 
Lorentz group is a valid symmetry of the physical system 
(Weinberg, 1995). 



D. Classical analogies 

Are relativity and quantum theory really involved in 
these issues? The matter of information transfer by 
means of distant measurements is essentially nonrela- 
tivistic. Replace "superluminal" by "supersonic" and 
the argument is exactly the same. The maximal speed 
of communication is determined by the dynamical laws 
that govern the physical infrastructure. In quantum field 
theory, the field excitations are called "particles" and 
their speed over macroscopic distances cannot exceed the 
speed of light. In condensed matter physics, linear exci- 
tations are called phonons and the maximal speed is that 
of sound. 

As to the EPRB setup, consider an analogous classi- 
cal situation: a bomb, initially at rest, explodes into two 
fragments carrying opposite angular momenta. Alice and 
Bob, far away from each other, measure arbitrarily cho- 



sen components of Ji and J2. (They can measure all 
the components, since these have objective values.) Yet, 
Bob's measurement tells him nothing of what Alice did, 
nor even whether she did anything at all. He can only 
know with certainty what would be the result found by 
Alice if she measures her J along the same direction as 
him, and make statistical inferences for other possible 
directions of Alice's measurement. 

The classical-quantum analogy becomes complete if we 
use classical statistical mechanics. The distribution of 
bomb fragments is given by a Liouville function in phase 
space. When Alice measures Ji, the Liouville function 
for J2 is instantly altered, however far Bob is from Al- 
ice. No one finds this surprising, since it is universally 
agreed that a Liouville function is only a mathematical 
tool representing our statistical knowledge. Likewise, the 
wave function "0, or the corresponding Wigner function 
(Wigner, 1932) which is the quantum analogue of a Li- 
ouville function, are no more than mathematical tools 
for computing probabilities. It is only when they are re- 
garded as physical objects that superluminal paradoxes 
arise. 

The essential difference between the classical and quan- 
tum functions which change instantaneously as the result 
of measurements is that the classical Liouville function 
is attached to objective properties that are only imper- 
fectly known. On the other hand, in the quantum case, 
the probabilities are attached to potential outcomes of 
mutually incompatible experiments, and these outcomes 
do not exist "out there" without the actual interventions. 
Unperformed experiments have no results. 

IV. QUANTUM ENTROPY AND SPECIAL RELATIVITY 

A. Reduced density matrices 

In our discussion of the measuring process, decoherence 
was attributed to the unability of accounting explicitly 
for the degrees of freedom of the environment. The envi- 
ronment thus behaves an exosystem (Finkelstein, 1988) 
and the system of interest is "open" because parts of the 
universe are excluded from its description. 

This leads to the introduction of reduced density ma- 
trices: let us use Latin indices for the description of the 
exosystem (that is, if we were able to give it a description) 
and Greek indices for the subsystem that we can actually 
describe. The components of a state vector would thus 
be written Vm^ and those of a density matrix Pmfi.m- 
The reduced density matrix of the system of interest is 
given by 

m 

Even if p is a pure state (a matrix of rank one), r is in 
general a mixed state. Its entropy is defined as 

5 = -tr(rlogT). (25) 
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In a relativistic system, whatever is outside the past 
hght cone of the observer is unknown to him, but also 
cannot affect his system, therefore does not lead to de- 
coherence (here, we assume that no particle emitted by 
an exosystem located outside the past cone penetrates 
into the future cone.) Since observers located at different 
points have different past light cones, they exclude from 
their descriptions different parts of spacetime. Therefore 
any transformation law between them must tacitly as- 
sume that the part excluded by one observer is irrelevant 
to the system of the other observer. 

Another consequence of relativity is that there is a hi- 
erarchy of dynamical variables: primary variables have 
relativistic transformation laws that depend only on the 
Lorentz transformation matrix A that acts on the space- 
time coordinates. For example, momentum components 
are primary variables. On the other hand, secondary 
variables such as spin and polarization have transforma- 
tion laws that depend not only on A, but also on the mo- 
mentum of the particle. As a consequence, the reduced 
density matrix for secondary variables, which may be well 
defined in any coordinate system, has no transformation 
law relating its values in different Lorentz frames. A sim- 
ple example is given in Sec. IIV.BI Appendix A gives a 
summary of the relativistic state transformations for free 
particles. 

Moreover, an unambiguous definition of the reduced 
density matrix by means of Eq. H24|l is possible only if the 
secondary variables are unconstrained. For gauge field 
theories, that equation may be meaningless if it conflicts 
with constraints imposed on the physical states (Beck- 
man et ai, 2002; Peres and Terno, 2003). In the absence 
of a general prescription, a case-by-case treatment is re- 
quired. A particular construction, valid with respect to a 
certain class of tests, is given in Sec. lIV.Cl A general way 
of defining reduced density matrices for physical states in 
gauge theories is an open problem. 



B. Massive particles 

We first consider the relativistic properties of the spin 
entropy for a single, free particle of spin ^ and mass 
m > 0. We shall show that the usual definition of quan- 
tum entropy has no invariant meaning. The reason is 
that under a Lorentz boost, the spin undergoes a Wigner 
rotation (Wigner, 1939; Halpern, 1968) whose direction 
and magnitude depend on the momentum of the particle. 
Even if the initial state is a direct product of a function 
of momentum and a function of spin, the transformed 
state is not a direct product. Spin and momentum ap- 
pear to be entangled. (This is not the familiar type of 
entanglement which can be used for quantum communi- 
cation, because both degrees of freedom belong to the 
same particle, not to distinct subsystems that could be 
widely separated.) 

The quantum state of a spin-i particle can be written, 
in the momentum representation, as a two-component 



spmor. 



V'(P) = 



ai(p) 
a2(p) 



(26) 



where the amplitudes satisfy J2r I \'^r{p)\'^dp = 1. 
The normalization of these amplitudes is a matter of con- 
venience, depending on whether we prefer to include a 
factor pq — {m? -\- p^)^/^ in it, or to have such factors in 
the transformation law ()29|1 below. Following Halpern 
(1968), we shall use the second alternative, because it 
is closer to the nonrelativistic notation which appears in 
the usual definition of entropy. In this section, we use 
natural units: c = 1. 

Here we emphasize that we consider normalizable 
states, in the momentum representation, not momen- 
tum eigenstates as usual in textbooks on particle physics. 
The latter are chiefly concerned with the computation of 
(injout) matrix elements needed to obtain cross sections 
and other asymptotic properties. However, in general 
a particle has no definite momentum. For example, if 
an electron is elastically scattered by some target, the 
electron state after the scattering is a superposition that 
involves momenta in all directions. 

In that case, it still is formally possible to ask, in any 
Lorentz frame, what is the value of a spin component in a 
given direction (this is a legitimate Hermitian operator). 
In quantum information theory, the important issue does 
not reside in asymptotic properties, but how entangle- 
ment (a communication resource) is defined by different 
observers. Early papers on this subject used momentum 
eigenstates, just as in particle physics (Czachor, 1997). 
However, radically new properties arise when localized 
quantum states are considered. 

Let us define a reduced density matrix, r = 
J o?p'i/'(p)V'^(p)i giving statistical predictions for the re- 
sults of measurements of spin components by an ideal 
apparatus which is not affected by the momentum of the 
particle. The spin entropy is 



-tr (t logr) 



^AjlogAj, 



(27) 



where \j are the eigenvalues of t. 

As usual, ignoring some degrees of freedom leaves the 
others in a mixed state. What is not obvious is that in 
the present case the amount of mixing depends on the 
Lorentz frame used by the observer. Indeed consider an- 
other observer (Bob) who moves with a constant velocity 
with respect to Alice who prepared state H26II . In the 
Lorentz frame where Bob is at rest, the same spin-i par- 
ticle has a state 



V''(P) 



«i(p) 
a2(p) 



(28) 



The transformation law is (Weinberg, 1995) 



a'(p) = [{k-'p)o/poY'^ ^i?.,[A,(A-V)] a.(A-V), 

(29) 
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where Drs is the Wigner rotation matrix for a Lorentz 
transformation A. Further details of this transformation 
and its representation by a quantum circuit are given in 
Appendix A. 

As an example, take a particle prepared by Alice with 
spin in the z direction, so that a2(p) = 0. Spin and 
momentum are not entangled, and the spin entropy is 
zero. When that particle is described in Bob's Lorentz 
frame, moving with velocity /3 in a direction at an angle 9 
with Alice's z-axis, a detailed calculation shows that both 
a'l and a'2 are nonzero, so that the spin entropy is positive 
(Peres, Scudo, and Terno, 2002). This phenomenon is 
illustrated in Fig. 2. A relevant parameter, apart from 
the angle 9, is, in the leading order in momentum spread, 

r = ^ " , 30 

TO p 

where A is the momentum spread in Alice's frame. The 
entropy has no invariant meaning, because the reduced 
density matrix r has no covariant transformation law, 
except in the limiting case of sharp momenta. Only the 
complete density matrix transforms covariantly. 

How is the linearity of the transformation laws lost in 
this purely quantum mechanical problem? The momenta 
p do transform linearly, but the law of transformation 
of spin depends explicitly on p. When we evaluate r 
by summing over momenta in p, all knowledge of these 
momenta is lost and it is then impossible to obtain r' by 
transforming r. Not only is linearity lost, but the result 
is not nonlinearity in the usual sense of this term. It 
is the absence of any definite transformation law which 
depends only on the Lorentz matrix. 

It is noteworthy that a similar situation arises for a 
classical system whose state is given in any Lorentz frame 
by a Liouville function (Balescu and Kotera, 1967). Re- 
call that a Liouville function expresses our probabilistic 
description of a classical system — what we can pre- 
dict before we perform an actual observation — just as 
a quantum state is a mathematical expression used for 
computing probabilities of events. 

To avoid any misunderstanding, we emphasize that 
there is no consistent relativistic statistical mechanics 
for interacting particles, with a 6A^-dimensional phase 
space defined by the canonical coordinates p„ and q„ 
(n = 1, . . . , N). Any relativistic interaction must be me- 
diated by fields, having an infinity of degrees of freedom. 
A complete Liouville function, or rather Liouville func- 
tional, must therefore contain not only all the canonical 
variables p„ and q„, but also all the fields. However, once 
this Liouville functional is known (in principle), we can 
define from it a reduced Liouville function, by integrating 
the functional over all the degrees of freedom of the fields. 
The result is a function of p„ and q„ only (just as we 
compute reduced density matrices in quantum theory). 
The time evolution of such reduced Liouville functions 
cannot be obtained directly from canonical Hamiltonian 
dynamics without explicitly mentioning the fields. These 
functions are well defined in any Lorentz frame, but they 



have no relativistic transformation law. Only the com- 
plete Liouville functional, including the fields, has one. 

Consider now a pair of orthogonal states that were 
prepared by Alice. How well can moving Bob distinguish 
them, if he is restricted to measuring discrete degrees of 
freedom? We shall use the simplest criterion, namely the 
probability of error Pe, defined as follows: an observer 
receives a single copy of one of the two known states and 
performs any operation permitted by quantum theory in 
order to decide which state was supplied. The probability 
of a wrong answer for an optimal measurement is (Fuchs 
and van de Graaf, 1999) 

PB(pi,P2) = i + iW(Pl-P2)2. (31) 

In Alice's frame Pe = 0. It can be shown that in Bob's 
frame, P'^ (x F^, where the proportionality factor de- 
pends on the angle 9 defined above. Of course, the op- 
posite Lorentz transformation induces a change from a 
positive Pe in Bob's frame to P^^O in Alice's frame. 
We discuss the resulting effective quantum channel in 
Sec. irvTEl 




FIG. 2 Dependence of the spin entropy S, in Bob's frame, 
on the values of the angle 6 and a parameter F ~ [1 — (1 — 
Z?^)^/^] A/m/3, where A is the momentum spread in Alice's 
frame. 



C. Photons 

The long range propagation of polarized photons is 
an essential tool of quantum cryptography (Gisin et al., 
2002). Usually, optical fibers are used, and the photons 
may be absorbed or depolarized due to imperfections. In 
some cases, such as communication with space stations, 
the photons propagate in vacuo (Buttler et al, 2000). 
The beam then has a finite diffraction angle of order 
A/a, where a is the aperture size, and new deleterious 
effects appear. In particular a polarization detector can- 
not be rigorously perpendicular to the wave vector and 
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the transmission is never faithful, even with perfect de- 
tectors. Moreover, this "vacuum noise" depends on the 
relative motion of the observer with respect to the source. 

These relativistic effects are essentially different from 
those for massive particles that were discussed above, 
because photons have only two linearly independent po- 
larization states. The properties that we discuss are 
kinematical, not dynamical. At the statistical level, it 
is not even necessary to involve quantum electrodynam- 
ics. Most formulas can be derived by elementary clas- 
sical methods (Peres and Terno, 2003). It is only when 
we consider individual photons, for cryptographic appli- 
cations, that quantum theory becomes essential. The 
diffraction effects mentioned above lead to superselection 
rules which make it impossible to define a reduced den- 
sity matrix for polarization. As shown below, it is still 
possible to have "effective" density matrices; however, 
the latter depend not only on the preparation process, 
but also on the method of detection that is used by the 
observer. 

Assume for simplicity that the electromagnetic sig- 
nal is monochromatic. In a Fourier decomposition, the 
Cartesian components of the wave vector (with /i = 
0, 1, 2, 3) can be written in term of polar angles: 

— (1, sin cos (/), sin6' sin0, cos 0), (32) 

where we use units such that c = 1 and k^) — \. Let us 
choose the z axis so that a well collimated beam has a 
large amplitude only for small 9. 

In a real experiment, the angles 9 and are distributed 
in a continuous way around the z axis (exactly how de- 
pends on the properties of the laser) and one has to take 
a suitable average over them. As the definition of polar- 
ization explicitly depends on the direction of k, taking 
the average over many values of k leads to an impure 
polarization and may cause transmission errors. 

Let us consider the effect of a motion of the detec- 
tor relative to the emitter, with a constant velocity v = 
(0,0, u). The Lorentz transformation of k^ in Eq. (32) 
yields new components 

/sq = 7(1 — t;cos6') and k'^ ~ ^[cos9 — v), (33) 

where 7 = (1 — z;^)~^/^. Considering again a single 
Fourier component, we have, instead of the unit vector 
k, a new unit vector 

^,^( sing ^o^^^ii^y (34) 
\7(1 — wcost^) 1 — wcosfc'/ 

In other words, there is a new tilt angle 9' given by 

sing' sin 6*77(1 - u cos 6*). (35) 

For small 9, such that 9^ <C \v\, we have 

O'^e^^. (36) 



The square root is the familiar relativistic Doppler factor. 
For large negative v, the diffraction angle becomes arbi- 
trarily small, and sideway losses (which are proportional 
to g'^) can be reduced to zero. 

It is noteworthy that the same Doppler factor was 
obtained by Jarett and Cover (1981) who considered 
only the relativistic transformations of bit rate and noise 
intensity, without any specific physical model. This 
remarkable agreement shows that information theory 
should properly be considered as a branch of physics. 

In applications to secure communication, the ideal sce- 
nario is that isolated photons (single particle Fock states) 
are emitted. In a more realistic setup, the transmission 
is by means of weak coherent pulses containing on the 
average less than one photon each. A basis of the one- 
photon space is spanned by states of definite momentum 
and helicity, 

|k,e±)^|k)®|e±), (37) 

where the momentum basis is normalized by (q|k) = 
(27r)3(2fcO)5(3)(q _ k), and helicity states |e^) are ex- 
plicitly defined by Eq. H4U|) below. 

As we know, polarization is a secondary variable: 
states that correspond to different momenta belong to 
distinct Hilbert spaces and cannot be superposed (an ex- 
pression such as je^ ) + l^q ) meaningless if k ^ q). The 
complete basis (I37II does not violate this superselection 
rule, owing to the othogonality of the momentum basis. 
Therefore, a generic one-photon state is given by a wave 
packet 

^ j d/,(k)/(k)|k,a(k)). (38) 

The Lorentz-invariant measure is dfi(k) — (i'^k/(27r)'^2/c'^, 
and normalized states satisfy J dfi{k)\f{'k)\'^ = 1. The 
generic polarization state |Q!(k)) corresponds to the geo- 
metrical 3-vector 

a(k) -a+(k)e++a_(k)ek, (39) 

where ja+P + = 1, and the explicit form of is 

given below. 

Lorentz transformations of quantum states are most 
easily computed by referring to some standard momen- 
tum, which for photons is p'^ — (1,0,0, 1). Accordingly, 
standard right and left circular polarization vectors are 
= (1, ±z, 0)/a/2. For linear polarization, we take 
Eq. H39|) with a+ = (<^-)*: so that the 3-vectors a.(k) 
are real. In general, complex Q:(k) correspond to elliptic 
polarization. 

Under a Lorentz transformation A, these states be- 
come |kA, Q:(kA)), where kA is the spatial part of a four- 
vector k\ = Afc, and the new polarization vector can be 
obtained by an appropriate rotation given by Eq. 1421) 
below. For each k a polarization basis consists of the 
helicity vectors, 

e± = i?(k)e± (40) 
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and the corresponding quantum states are |k, ej). 

As usual, k denotes the unit 3-vector in the direc- 
tion of k. The standard matrix (Weinberg, 1995) 
that rotates the standard direction (0, 0, 1) to k = 
(sin 6 cos (f>, sin 9 sin 4>, cos 9) is 



cos a cos ( 
_R(k) — cosflsint; 

— sin 9 



— sm (p cos <p sm ( 
cos (f> sin (f) sin t 
cos 6* 



(41) 



and Hkewise for kA- 

Under a general Lorentz transformation, be it a rota- 
tion or a boost, helicity is preserved, but quantum states 
and the corresponding geometric vectors acquire helicity- 
dependent phases (see Appendix A for more details): 



kA 



-»?(A,k) 



(42) 



where the explicit expressions for ^(A, k) are given by 
Lindner, Peres, and Terno (2003) and Bergou, Gingrich 
and Adami (2003). 

The superselection rule that was mentioned above 
makes it impossible to define a reduced density matrix 
in the usual way (Peres and Terno, 2003; Lindner, Peres 
and Terno, 2003). We can however define an "effective" 
reduced density matrix for polarization, as follows. The 
labelling of polarization states by Euclidean vectors ejj, 
and the fact that photons are spin-1 particles, suggest 
the use of a 3 x 3 matrix with entries labelled x, y and z. 
Classically, they correspond to different directions of the 
electric field. For example, when k = z, only p^^, p^y, 
Pyy are non-zero. For a generic photon state I^I/), let us 
try to construct a reduced density matrix p^x that gives 
the expectation value of an operator representing the po- 
larization in the x direction, irrespective of the particle's 
momentum. 

To have a momentum-independent polarization is to 
tacitly admit longitudinal photons. Unphysical concepts 
are often used in intermediate steps in theoretical physics. 
Momentum-independent polarization states thus consist 
of physical (transversal) and unphysical (longitudinal) 
parts, the latter corresponding to a polarization vector 
= k. For example, a generalized polarization state 
along the x-axis is 

\±)^x+{k)\e+)+x^{k)\e^)+xe{k)\ei), (43) 

where x±(k) = • x, and xi(k) = x • k = sin cos It 
follows that |a;+p -I- -I- |a;^P = 1, and we thus define 



exik) 



a;+(k)< 



_(k). 



(44) 



as the polarization vector associated with the x direction. 
It follows from (|43|l that (x|x) ~ 1 and (x|y) = x-y = 0, 
and likewise for other directions, so that 



To the direction x corresponds a projection operator 

Px^\±) {±\ ® Ip = |x) (i| ® J dn(k)\k) (k| , (46) 

where Ip is the unit operator in momentum space. The 
action of P^; on follows from Eq. and (e^ |e^) = 0. 
Only the transversal part of |x) appears in the expecta- 
tion value: 

{^\Px\^)= I d,i(k)\f{k)f\x+ik)a*,ik)+x^ik)a*_ik)f. 

(47) 

It is convenient to write the transversal part of |x) as 

|b.(k)) ^ (I4)(e^l + Iek>(ekl)|x), (48) 
= x+ik)\e+)+x^{k)\e^). (49) 

Likewise define |bj,(k)) and |b2(k)). These three state 
vectors are neither of unit length nor mutually orthogo- 
nal. For k = (sin 9 cos 0, sin 9 sin (j), cos 9) we have 

|ba;(k)) = ^(cos^cosc?!) -I- isin0)|e^) + (50) 
^(cos6'cos0 - isin(/))|ej^) = c{9,(p)\k,ex{k)), (51) 

where e^(k) is given by Eq. H44|l . and c{9,(p) = 



Finally, a POVM element which is the physical 
part of Px , namely is equivalent to Px for physical states 
(without longitudinal photons) is 



dAi(k)|k,b,(k))(k,b,(k)|, 



(52) 



and likewise for the other directions. The operators Ex, 
Ey and E^ indeed form a POVM in the space of physical 
states, owing to Eq. H45|) . The above derivation was, ad- 
mittedly, a rather circuitous route for obtaining a POVM 
for polarization. This is due to the fact that the latter 
is a secondary variable, subject to superselection rules. 
Unfortunately, this is the generic situation. 

The entire effective density matrix is reconstructed us- 
ing techniques of Chuang and Nielsen (1997), and we get 
a simple expression for the reduced density matrix cor- 
responding to the polarization state |Q;(k)): 



d/.(k)|/(k)|2(a(k)|b™(k))(b„(k)|a(k)) (53) 



Pn 



It is interesting to note that this derivation gives a direct 
physical meaning to the naive definition of a reduced den- 
sity matrix, 



dM(k)|/(A:)|2a„,(kX(k)=p„ 



(54) 



|i)(x| + |y)(y| + |z)(z| = l. 



(45) 



Since polarization 3- vectors transform under rotations re- 
gardless of momentum, the effective 3x3 polarization 
density matrix has a standard transformation law under 
rotation R as well, p RpR^ . 
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Our basis states |k, Ck) are direct products of momen- 
tum and polarization. Owing to the transversality re- 
quirement ek • k = 0, they remain direct products un- 
der Lorentz transformations. All the other states have 
their polarization and momentum degrees of freedom en- 
tangled. As a result, if one is restricted to polarization 
measurements as described by the POVM elements ()52|l. 
there do not exist two orthogonal polarization states. It 
follows that photon polarization states cannot be cloned 
perfectly, because the no-cloning theorem (Wootters and 
Zurek, 1982; Dieks, 1982) forbids an exact copying of un- 
known non-orthogonal states. In general, any measure- 
ment procedure with finite momentum sensitivity will 
lead to the errors in identification. 

Our present problem is the distinguishability by our 
observer. Bob, of a pair of different quantum states that 
were prepared by Alice. The probability of an error by 
Bob is given by Eq. (|31(l . The distinguishability of polar- 
ization density matrices depends on the observer's mo- 
tion. We again assume that Bob moves along the z-axis 
with a velocity v. Let us calculate his reduced density 
matrix. Recall that reduced density matrices have no 
transformation law (only the complete density matrix has 
one) except in the limiting case of sharp momenta. To 
calculate Bob's reduced density matrix, we must trans- 
form the complete state, and only then take a partial 
trace. A detailed calculation (Peres and Terno, 2003) 
leads to 

Pe = \^Pe, (55) 

1 — V 

which may be either larger or smaller than Pe- As ex- 
pected, we obtain for one-photon states the same Doppler 
effect as in the classical equation (|36f) . 

D. Entanglement 

An important problem is the relativistic nature of 
quantum entanglement when there are several particles. 
For two particles, an invariant definition of the entangle- 
ment of their spins would be to compute it in the Lorentz 
"rest frame" where p) = 0. However, this simple def- 
inition is not adequate when there are more than two 
particles, because there appears a problem of cluster de- 
composition: each subset of particles may have a different 
rest frame. This is a difficult problem, still awaiting for 
a solution. We shall mention only a few partial results. 

First, we have to define a convenient measure of entan- 
glement. For two spin-i particles, the concurrence, C{p), 
is defined as follows (Wootters, 1998). Introduce a spin- 
fiipped state p = {ay (g) ay)p*{ay ® Uy). The concurrence 
is 

C(p) = max(0, Ai - A2 - A3 - A4), (56) 

where A^ are the eigenvalues, in decreasing order, of the 
Hermitian matrix [■^p^/pY^'^ ■ The larger the concur- 
rence, the stronger the entanglement: for maximally en- 



tangled states C = 1, while for non-entangled states 
C = 0. 

Alsing and Milburn (2002) considered bipartite states 
with well-defined momenta. They showed that while 
Lorentz transformations change the appearance of the 
state in different inertial frames and the spin directions 
are Wigner rotated, the amount of entanglement remains 
intact. The reason is that Lorentz boosts do not cre- 
ate spin-momentum entanglement when acting on eigen- 
states of momentum, and the effect of a boost on a pair 
is implemented on both particles by local unitary trans- 
formations, which are known to preserve entanglement. 
The same conclusion is valid for photon pairs. 

In particular, Hacyan (2001) showed that since the 
polarization angle remains constant in the polarization 
plane, the directions of perfect correlation for two pho- 
tons still exist in any reference frame, even if they are 
different from the laboratory directions. Terashima and 
Ueda (2003) showed that in a quite general setting for 
both massive and massless particles, allowing for relative 
motion, it is always possible to find directions of perfect 
(anti)correlations. 

However, realistic situations involve wave packets. For 
example, a state of two spin-^ particles is 

|TC"l2) = X! / ^A'(Pl)c^M(P2)5(criCr2,Pl,P2)|Pl,(Ti;P2,C^2), 

(57) 

where dp,{p) = d'^p/167r'^p° as usual. 

For typical particle beams, g is sharply peaked at some 
values pio, P20- Again, a boost to any Lorentz frame will 
result in a unitary C/(A) ® U{K) acting on each particle 
separately, thus preserving the entanglement. Neverthe- 
less, since boosts can change entanglement between dif- 
ferent degrees of freedom of each particle, the spin-spin 
entanglement is frame-dependent as well. 

Gingrich and Adami (2002) investigated the reduced 
density matrix for IT12) and made explicit calculations 
for the case where g is a Gaussian, as in the work of 
Peres, Scudo, and Terno (2002). They showed that if 
two particles are maximally entangled in a common, ap- 
proximate rest frame (Alice's frame), then C{p), as seen 
by a Lorentz-boosted Bob, decreases when the boost ve- 
locity tends to c. Of course, the inverse transformation 
from Bob to Alice will increase the concurrence. Thus, we 
see that that spin-spin entanglement is not a Lorentz in- 
variant quantity, exactly as spin entropy is not a Lorentz 
scalar. Relativistic properties of the polarization entan- 
glement we investigated by Bergou, Gingrich and Adami 
(2003). 

E. Communication channels 

Although reduced polarization density matrices have 
no general transformation rule, the above results show 
that such rules can be established for particular classes 
of experimental procedures. We can then ask how these 
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effective transformation rules, r' = T(t), fit into the 
framework of general state transformations. Are they 
completely positive (CP) as in Eq. 0? It can be proved 
that distinguishability, as expressed by natural measures 
like Pe, cannot be improved by any CP transforma- 
tion (Fuchs and van de Graaf, 1999). However, the CP 
requirement may fail if there is a prior entanglement 
with another system and the dynamics is not factorizable 
(Pechukas, 1994; Stelmachovic and Buzek, 2001; Salgado 
and Sanchez-Gomez, 2002). 

Since in Eq. H55|) and in the discussion following 
Eq. H31|) we have seen that distinguishability can be im- 
proved, we conclude that these transformations are not 
completely positive. The reason is that the Lorentz trans- 
formation acts not only on the "interesting" discrete vari- 
ables, but also on the primary momentum variables that 
we elected to ignore and to trace out, and its action on 
the interesting degrees of freedom depends on the "hid- 
den" primary ones. Of course, the complete state, with 
all the variables, transforms unitarily and distinguisha- 
bility is preserved. 

This technicality has one important consequence. In 
quantum information theory quantum channels are de- 
scribed by completely positive maps that act on qubit 
states (Holevo, 1999; Keyl, 2002). Qubits themselves are 
realized as discrete degrees of freedom of various parti- 
cles. If relativistic motion is important, then not only 
does the vacuum behave as a noisy quantum channel, 
but the very representation of a channel by a CP map 
fails. 



V. THE ROLE OF QUANTUM FIELD THEORY 

The POVM formalism is an essential tool of quantum 
information theory. Entanglement is a major resource 
for quantum communication and computation. In this 
section we present results of quantum field theory that 
are important for the relativistic generalization of these 
concepts. Mathematical results are stated in an informal 
way. Rigorous formulations and fine mathematical points 
can be found in the references that are supplied for each 
concept or theorem we introduce. 



A. General theorems 

First, we define the notions of local and quasi- local 
operators (Emch, 1972; Bogolubov et ai, 1990; Haag, 
1996; Araki, 1999). Local operators are associated with 
bounded regions of spacetime. For example, they may 
be field operators that are smeared with functions of 
bounded support (that is, functions that vanish if their 
argument is outside of a prescribed bounded region O 
of spacetime). Smeared renormalized stress-energy ten- 
sors also belong to this category. Quasi-local operators 
are obtained when the smearing functions have exponen- 
tially decaying tails. 



Theorem. The set of states A{0)\^1), generated from 
the vacuum \ by the (polynomial) algebra of operators 
in any bounded region, is dense in the Hilbert space of 
all field states. □ 

This is the Reeh-Schlieder theorem (Reeh and 
Schlieder, 1961; Streater and Wightman, 1964; Haag 
1996; Araki, 1999). It asserts that there are local op- 
erators Q G A{0) which, applied to the vacuum, pro- 
duce a state which is arbitrarily close to any arbitrary 
|T) (the vacuum state can be replaced by any state of 
finite energy). Thus in principle any entangled state can 
be arbitrarily closely approximated by suitable local op- 
erations on any other state. 

The theorem reveals a surprising amount of entangle- 
ment that is present in the vacuum state \ The corol- 
lary below shows that if a local operator is used to model 
a detector, that detector must have "dark counts" : it has 
a finite probability to "click" in a vacuum. 

Corollary. No operator that is localized in a bounded 
spacetime region annihilates the vacuum (nor any other 
physical state). □ 

Another important theorem is due to Epstein, Glaser 
and Jaffe (1965): 

Theorem. If a field Q{x) satisfies (^'|Q(x)|^') > for 
all states, and if {il\Q{x)\fl) — for the vacuum state, 
then Q{x) =0. □ 

This implies that no POVM constructed from local 
or quasi-local operators can have zero vacuum response. 
The theorem predicts for any local field Q{x) that has a 
zero vacuum expectation value, namely {n\Q{x)\fl) = 0, 
there exists a state for which the expectation value of 
Q{x) is negative. Further details can be found in the 
original article and in Tippler (1978). 

Another implication is a violation of the classical en- 
ergy conditions (Hawking and Ellis, 1973; Wald, 1984). 
Classically, energy density is always positive and the 
stress-energy tensor for all classical fields satisfies the 
weak energy condition (WEC) T^^u^^u'^ > 0, where is 
any timelike or null vector. The Epstein-Glaser- Jaffe the- 
orem shows that this is impossible for the renormalized 
stress-energy tensor of quantum field theories. Since it 
has by definition a null vacuum expectation value, there 
are states |T) such that (T|T^i,u^u'^|T) < 0. For exam- 
ple, squeezed states of the electromagnetic field (Man- 
del and Wolf, 1995), or the scalar field (Borde, Ford, 
and Roman, 2002), have locally negative energy densi- 
ties. The violation of WEC raises doubts on the use of 
energy density for the description of particle localization, 
as discussed in Sec. IV.BI 

While any entangled state can be approximated by the 
action of local operators on the clustering property 
of the vacuum^* asserts that states created by local oper- 
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ations, namely Q\^), Q € A{0), tend to look practically 
like a vacuum with respect to measurements in distant, 
causally unconnected regions. The behavior of detectors 
that are far away from each other is ruled by the fol- 
lowing theorems, where, for a local operator B E .4(0), 
we denote by i?x its translate by a spatial vector x, i.e., 
^ U{x)BW{x). 

Theorem, li A,B g A{0) are local operators and 
is the vacuum state, then 

{n\AB^\n) ^""-^ {n\A\n){n\B^\n). (58) 

There are estimates on the rate of convergence of the 
above expression as a function of the spacelike separa- 
tion for the cases of massive and massless particles. The 
asymptotic behavior depends on that of the Wightman 
function W{xi,X2) for \xi — a;2p —>■ oo (Streater and 
Wightman, 1964; Bogolubov et al. 1990; Haag, 1996). 

Theorem. If A e A{Oi) and B e A{02), where Oi 
and O2 are mutually spcelike regions with a spacelike 
separation r, then 

\{n\AB\n) - {n\A\n){n\B\n)\ (59) 

for a massless theory is bounded by 

f{0,,02,A,B)/r^, (60) 

where / is a certain function that depends on the regions 
and the operators, but not on the distance between the 
regions; for a massive theory it is bounded by 

e-"^^g{A,B), (61) 

where m is the relevant mass and g depends on the op- 
erators only. In this case 01,02 may be unbounded. □ 

The explicit derivation of the coefficients requires a 
more detailed treatment. Particular cases and values of 
numerical constants are given by Emch (1972), Freden- 
hagen (1985), Haag (1996), and Araki (1999). 

While it seems that vacuum correlations for massless 
fields decay much slower, the difference disappears if the 
finite sensitivity of detectors for soft photons is taken into 
account. It was shown by Summers and Werner (1987) 
that if a detector has an energy threshold e, the latter 
serves as an effective mass in correlation estimates, and 
an additional e"'^'' factor appears in Eq. H60|) . 

B. Particles and localization 

Classical interventions in quantum systems are local- 
ized in space and time. However, the principles of quan- 
tum mechanics and relativity dictate that this localiza- 
tion is only approximate. The notion of particles has 
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an operational meaning only owing to their localization: 
particles are what is registered by detectors. 

When quantum mechanics was a new science, most 
physicists wanted to preserve the notions with which 
they were familiar, and considered particles as real ob- 
jects having positions and momenta that were possibly 
unknown, and/or subject to an "uncertainty principle." 
Still, a few writers expressed critical opinions, for exam- 
ple ". . . no scheme of operations can determine experi- 
mentally whether physical quantities such as position and 
momentum exist. . . we get into a maze of contradictions 
as soon as we inject into quantum mechanics such con- 
cepts carried over from the language of our ancestors. . . " 
(Kemble, 1937). 

More recently, Haag (1996) wrote 

". . . it is not possible to assume that an elec- 
tron has, at a particular instant of time, any 
position in space; in other words, the concept 
of position at a given time is not a meaningful 
attribute of the electron. Rather, 'position' 
is an attribute of the interaction between the 
electron and a suitable detection device." 

We shall first briefly examine some aspects of the old 
fashioned approach to localization. First we note that 
even when we construct a local probability density (and, 
possibly, a corresponding current) it is impossible to in- 
terpret p(x, t)d^x as the probability to find a particle in 
the volume d^x at the space point x. It was argued by 
Landau and Peierls (1931) that a particle may be local- 
ized only with uncertainty Aa; > he/ (E), where (E) is the 
particle's expected energy. Intuitively, confinement of a 
particle to a narrower domain by "high walls" requires 
a very strong interaction which leads to pair production. 
Haag and Swieca (1965) have shown that restriction to a 
compact region of spacetime makes it impossible to de- 
tect with certainty any state. Hegerfeldt (1985) proved 
that if a one-particle POVM leads to probability distribu- 
tions such that the total probability of finding a particle 
outside a sphere of radius R at time t is bounded by 

Prob^fl < C2exp(-27i?), (62) 

where C is some constant and 7 > m, then at later times 
the probability distribution will spread faster than light. 
Furthermore, Giannitrapani (1998) and Toller (1999) 
proved that a spacetime localized POVM cannot be con- 
structed even from quasi-local operators. General discus- 
sions of localization from the point of view of algebraic 
quantum field theory can be found in the works of Buch- 
holz and Fredenhagen (1982), Roberts (1982), Neumann 
and Werner (1983), Werner (1986) and Haag (1996). 

Much earlier, Newton and Wigner (1949) had at- 
tempted to define a position operator, whose spectral de- 
composition (Wightman, 1962) gives a rough indication 
of the particle localization. However, it was shown by 
Rosenstein and Usher (1987) that Gaussian- like Newton- 
Wigner wave functions lead to superluminal propagation 
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of probability distributions. Busch (1999) reviewed the 
problems involved in the construction of POVMs for par- 
ticle localization. 

Energy density is directly related to photon local- 
ization in quantiun optics (Mandel and Wolf, 1995; 
Bialynicki-Birula, 1996). If the electrons in a detector 
interact with the electric field of light, then in a simple 
model the detection probability is proportional to the 
expectation value of the normal-ordered electric field in- 
tensity operator /(x, f) (Mandel, 1966), and the latter is 
proportional to the energy density. This probability dis- 
tribution decays asymptotically as the seventh power of 
distance, or even slower (Amrein, 1969). Despite it suc- 
cess in these examples, the notion of localization based 
on the energy density cannot have a universal validity, 
because the violation of WEC makes it unsuitable for 
the construction of POVMs. 

The real physical problem is how localized detectors 
can be. The idealization of "one detector per spacetime 
point" is obviously impossible. How can we manage to 
ensure that two detectors have zero probability to over- 
lap? There appears to be a fundamental trade-off be- 
tween detector reliability and localizability. The bottom 
line is how to formulate a relativistic interaction between 
a detector and the detected system. A true detector 
should be amenable to a dual quantum-classical descrip- 
tion, as in the Hay-Peres model (1998). This problem 
seems to be very far from a solution. Completely new 
notions may have to be invented. 

Although states with a definite number of particles are 
a useful theoretical concept, a look at quantum optics 
techniques or at the Table of Particle Properties shows 
that experimentally accesible quantum states are usually 
not eigenstates of partic;le rnimbcr operators. In general 
any process that is not explicitly forbidden by some con- 
servation law has a non-zero amplitude (Weinberg, 1995; 
Peskin and Schrocder, 1995; Haag, 1996). There arc mul- 
tiple decay channels, extra soft photons may always ap- 
pear, so that the so-called 'one-photon' states are often 
accompanied by soft multiphoton components, 

a|0)+/3|U+7|2^,^,,) + ..., |/3|~1. (63) 

Thus the physical realization of a single qubit is itself 
necessarily an idealization. 

C. Entanglement in quantum field theory 

Recall that while the Reeh-Schlieder theorem ensures 
that any state can be approximated by local operations, 
the clustering property of the vacuum implies that lo- 
cally created states look almost like a vacuum for distant 
measurements. The Reeh-Schlieder and Epstcin-Glaser- 
Jaffe theorems entail dark counts for local detectors. The 
responses of spatially separated detectors are correlated, 
but these correlations decay fast due to cluster proper- 
ties. 



We now consider correlation experiments with devices 
a and b placed in spacclike-scparatcd regions Ol and Or, 
so all local operators pertaining to these regions com- 
mute: [A{0 l) , A{0 r)] = 0. In each region, there are 
two such devices, labelled ai,a2, 61,62, which yield out- 
comes "yes" or "no" in each individual experiment. We 
denote the probabilities for positive outcomes as p{aj) 
and p{bk), and by p{aj A bk) the probability of their joint 
occurrence. 

The measuring apparatus aj is described by a POVM 
element Fj e A{Ol) and the probability of the "yes" 
outcome for a state p is tr (pFj). If Gk is the POVM 
for apparatus bk then the probability of the "yes-yes" 
outcome is tr {p FjGk)- Let us to introduce operators 
Aj = 2Fj - 1 and Bk = 2Gk - 1, and define 

C(a, 6, p) = itr {p[A,{B, + B2) + ^2(^1 - B2)]}. (64) 

This quantity, which is experimentally measurable, has a 
classical analogue whose value is bounded: C < I. This 
is the CHSH inequality (Clauser et al, 1969), which is 
one of the variants of the Bell inequality (Bell, 1964)."^^ 
The above definition of C can be extended to 

C{A,B,p)=supC{a,b,p), (65) 

where A = A{Ol), B = A{Ori), and the supremum 
is taken over all operators Aj,Bk. It was shown by 
Cirel'son (1980) that there is also a quantum bound on 
correlations: for commuting algebras A and B and any 
state p, 

aA,B,p)<V2. (66) 

Further results of Siunnicrs and Werner (1985, 
1987a, b) and Landau (1987) establish that a violation 
Bell's inequalities is generic in quantum field theory. For 
any two spacelike separated regions and any pairs of op- 
erators, a, 6, there is a state p such that the CHSH in- 
equality is violated, namely, ({a,b,p) > 1. With addi- 
tional technical assumptions the existence of a maximally 
violating state pm can be proved: 

C(a, 6, p„0 = V2, (67) 

for any spacelike separated regions Ol and Or. It follows 
from convexity arguments that states that maximally vi- 
olate Bell inequalities are pure. What are then the op- 
erators that lead to maximal violation? Summers and 
Werner (1987a) have shown that operators Aj and Bk 
that give ( = V2 satisfy A'j = 1 and A1A2 + A2A1 = 0, 
and likewise for Bk- If we define A3 := —i[Ai,A2]/2, then 
these three operators have the same algebra as Pauli spin 
matrices (Summers, 1990). Even if we ignore the problem 



Recall that the Bell inequalities arc essentially classical (Peres, 
1993). Their violation by a quatum system is a sufBcient condi- 
tion for entanglement, but not a necessary one. 
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of localization (Sec. V.B), a violation of Bell inequalities 
is not at all trivial, as the analysis of various relativis- 
tic spin operators shows (Terno, 2003). For example, for 
moving observers, if the observables are constructed by 
means of the Pauli-Lubanski operator, the amount of vio- 
lation of Bell's inequality decreases with increasing veloc- 
ity, and the inequality is satisfied in the ultra-relativistic 
limit (Czachor, 1997; Ahn et ai, 2003). 

The violation of Bell's inequalities by the vacuum state 
does not mean that it is enough to have two detectors and 
check their dark count coincidences. The cluster theorem 
predicts a strong damping of the violations with distance. 
When the lowest relevant mass is m > 0, clustering leads 
to the estimate 

aA{OL),A{OR),n) <l + Ae^p[~mr{OL,OR)], (68) 

where r(C'L,Cij) is the separation between the regions 
(Summers and Werner, 1985, 1987a, b). For massless par- 
ticles, the energy threshold for photodetection serves as 
an effective mass. Therefore, a direct observation of vac- 
uum entanglement should be extremely difficult. Reznik 
(2000) proposed a method to convert vacuum entangle- 
ment into conventional bipartite entanglement. It re- 
quires to switch on and off in a controllable way the 
interaction between two-level systems and a field. Ap- 
propriately tailored local interaction Hamiltonians can 
then transfer vacuum entanglement to atoms. 

The classification of entangled states and their manip- 
ulation are current research topics in quantum informa- 
tion theory. Up to now we have dealt with entangle- 
ment of a finite number of degrees of freedom, or spin- 
momentum entanglement. After introducing Lorentz 
transformations, we were still able to use the standard 
techniques of the non-relativistic theory. However, in the 
general case, infinite-dimensional Hilbert spaces are in- 
volved. Recently Parker, Bose, and Plenio (2000), Eisert, 
Simon, and Plenio (2002), and Keyl, Schlingemann, and 
Werner (2003) investigated the entanglements of forma- 
tion and of distillation in infinite-dimensional systems. 

When the Hilbert space of a bipartite system is infinite 
dimensional, some peculiarities arise. For pure states, 
a natural measure of entanglement is the von Neumann 
entropy S = — tr p In p of either one of the reduced density 
matrices. It can be shown (Eisert, Simon and Plenio, 
2002) that in an arbitrarily small neighborhood of any 
state there is an infinity of entangled states. The reason is 
that in the neighborhood of any state with finite energy, 
there are states of infinite entropy (Wehrl, 1978)."'^^ This 
seems paradoxical, but if we consider states with bounded 
energy only, the continuity of the degree of entanglement 
is restored. 

Keyl, Schlingemann and Werner (2003) applied tech- 
niques of operator algebra to systems with an infinite 



The set of states with infinite entropy is trace-norm dense in the 
state space. 



number of degrees of freedom. A usuful device in the de- 
scription of infinite sytems is the notion of singular states, 
which cannot be represented by density operators: states 
are considered to be just positive linear functionals on 
the space of POVMs, and only non-singular states are 
represented by density operators (Emch, 1972; Bratteli 
and Robinson, 1987). One of their results is a rigorous 
description to the original EPR (1935) state, which can 
be modeled as a sequence of more and more squeezed 
two-mode states, and actually is a singular state. 

Pachos and Solano (2003) discussed the generation of 
entangled states and performed ab initio QED calcula- 
tions for the case of two interacting spin-i charged par- 
ticles. They obtained particular results for low energy 
scattering, and more general situations are under inves- 
tigation. 

D. Accelerated detectors 

In quantum field theory, the vacuum is defined as the 
lowest energy state of a field. A free field with linear 
equations of motion can be resolved into normal modes, 
such as standing waves. Each mode has a fixed frequency 
and behaves as a harmonic oscillator. The zero point mo- 
tion of all these harmonic oscillators is called "vacuum 
fluctuations" and the latter, under suitable conditions, 
may excite a localized detector that follows a trajectory 
x'^{t) parametrized by its proper time t. The internal 
structure of the detector is described by non-relativistic 
quantum mechanics, so that we can indeed assume that it 
is approximately localized, and it has discrete energy lev- 
els En- Furthermore, we assume the existence of a linear 
coupling of an internal degree of freedom, /i, of the de- 
tector, with the scalar field (j){x{T)) at the position of the 
detector. First-order perturbation theory gives the fol- 
lowing expression for the transition probability per unit 
proper time: 

g^Y.\{EMEo)\' f dre-'(^-^'')-M/(r), (69) 
where g is a coupling constant and 

W{t) = W{x{t{),x{t2)), T = Tl-T2, (70) 

is the Wightman function, defined by W{xi,X2) = 
{^l\^{xi)^l){x2)\^) for two arbitrary points on the detec- 
tor's trajectory (Streater and Wightman, 1964). The in- 
tegral in Eq. (|69|l is the Fourier transform of the auto- 
correlation. In other words, it gives the power spectrum 
of the Wightman function. 

For inertial detectors (that is, = v'^t with a con- 
stant four- velocity w") the transition probability is zero, 
as one should expect. However, the response rate does 
not vanish for more complicated trajectories. Consider 
in particular one with constant proper acceleration a. 
With an appropriate choice of initial conditions, it corre- 
sponds to the hyperbola ^x^ = 1/a^, shown in Fig. 3. 
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Then the transition rate between levels appears to be 
the same as for an inertial detector in equilibrium with 
thermal radiation at temperature T = ha/2'!Tck^. This 
phenomenon is called the Unruh (1976) effect. It was 
also discussed by Davics (1975) and it is related to the 
fluctuation-dissipation theorem (Candelas and Sciama, 
1977) and to the Hawking effect that will be dicussed in 
the next section. A rigorous proof of the Unruh ef- 
fect in Minkowski spacetime was given by Bisognano and 
Wichmann (1976) in the context of axiomatic quantum 
field theory, thus establishing that the Unruh effect is not 
limited to free field theory. 
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FIG. 3 Dependence of the spin entropy S, in Bob's frame, 
on the values of the angle 6 and a parameter F ~ [1 — (1 — 
/3'^)^''^] A/m/3, where A is the momentum spread in Alice's 
frame. 

For any reasonable acceleration, the Unruh tempera- 
ture is incomparably smaller that the black-body temper- 
ature of the cosmic background, or any temperature ever 
attained in a laboratory, and is not observable. Levin, 
Peleg, and Peres (1992) considered the effect of shield- 
ing a hypothetical experiment from any parasitic sources. 
This, however, creates a radically new situation, because 
the presence of a boundary affects the dynamical prop- 
erties of the quantum field by altering the frequencies of 
its normal modes. Finite-size effects on fields have been 
known for a long time, both theoretically (Casimir, 1948) 
and experimentally (Spaarnay, 1958). Levin, Peleg, and 
Peres showed that if the detector is accelerated together 
with the cavity that shields it, it will not be excited by 
the vacuum fluctuations of the field. On the other hand, 
an inertial detector freely falling within such an acceler- 



ated cavity will be excited. The relevant property in all 
these cases is the relative acceleration of the detector and 
the field normal modes. 

We now consider the evolution of an arbitrary quan- 
tum system. An observer at rest (Alice) can describe the 
quantum evolution on consecutive parallel slices of space- 
time, t = const. What can Bob, the accelerated observer, 
do? From Fig. 3, one sees that there is no communica- 
tion whatsoever between him and the region of spacetime 
that lies beyond both horizons. Where Alice sees a pure 
state. Bob has only a mixed state. Some information is 
lost. We shall return to this subject in the next, final 
section. 



VI. BEYOND SPECIAL RELATIVITY 

It took Einstein more than ten years of intensive work 
to progress from special relativity to general relativity. 
Despite its name, the latter is not a generalization of 
the special theory, but a radically different construct: 
spacetime is not only a passive arena where dynamical 
processes take place, but has itself a dynamical nature. 
At this time, there is no satisfactory quantum theory of 
gravitation (after seventy years of efforts by leading the- 
oretical physicists). 

In the present review on quantum information theory, 
we shall not attempt to use the full machinery of general 
relativity, with Einstein's equations. We still consider 
spacetime as a passive arena, endowed with a Rieman- 
nian metric, instead of the Minkowski metric of special 
relativity. The difference between them is essential: it 
is necessary to introduce notions of topology, because it 
may be impossible to find a single coordinate system that 
covers all of spacetime. To achieve that result, it may be 
necessary to use several coordinate patches, sewed to each 
other at their boundaries. Then in each patch, the metric 
is not geodesically complete: a geodesic line stops after a 
finite length, although there is no singularity there. The 
presence of singularities (points of infinite curvature) is 
another consequence of Einstein's equations. It is likely 
that these equations, which were derived and tested for 
the case of moderate curvature, are no longer valid under 
such extreme conditions. We shall not speculate on this 
issue, and we shall restrict our attention to the behav- 
ior of quantum systems in the presence of horizons., in 
particular of black holes. Before we examine the latter, 
let us first return to entanglement, now in curved space- 
time, and to the Unruh effect, still in flat spacetime, but 
described now in an accelerated coordinate system. 



Properties of detectors undergoing circular acceleration, as in 
high energy accelerators, were investigated by Bell and Leinaas 
(1983), Levin, Peleg, and Peres (1993), and by Davies, Dray, and 
Manogue (1996). 



Concepts of quantum information were recently invoked in sev- 
eral problems of quantum gravity and quantum cosmology, but 
we restrict ourselves to conventional black hole physics. 
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A. Entanglement revisited 

Calculations on EPRB correlations require a common 
reference frame. Only then can statements such as "if 
mi2=i, then m2z—^^" have an operational meaning. In 
a curved space we can choose an arbitrary frame at one 
spacetimc point and then translate it parallel to itself 
along a geodesic path. For example, spin-i particles may 
be sent to Alice and Bob, far away. After a reference 
frame is chosen at the emission point, local frames are 
established for them by parallel transport along the parti- 
cles' trajectories. However, particles only approximately 
follow classical geodesic trajectories, and this inevitably 
introduces uncertainties in the definition of directions. 
Using path integral methods, von Borzeszkowski and 
Mensky (2000) have shown that if certain conditions are 
met, approximate EPR correlations still exist, but "the 
longer the propagation and the stronger the gravitational 
field, the poorer is the correlation". 

One of the difficulties of quantum field theory in curved 
spacetimes is the absence of a unique (or preferred) 
Hilbert space, the reason being that different represen- 
tations of canonical commutation or anticonnnutation 
relations lead to unitarily inequivalent representations 
(Emch, 1972; Bogolubov et al, 1990; Haag 1996). For 
the Minkowski spacetimc, the existence of a preferred 
vacuum state enables us to define a unique Hilbert space 
representation. A similar construction is also possible in 
stationary curved spacetimes (Fulling, 1989; Wald, 1994). 
However, in a general globally hyperbolic spacetimc this 
is impossible, and one is faced with multiple inequivalent 
representations. 

Genuinely different Hilbert spaces with different den- 
sity operators and POVMs apparently lead to predictions 
that depend on the specific choice of the method of cal- 
culation. The algebraic approach to field theory can re- 
solve this difficulty for PVMs. The essential ingredient 
is the notion of physical equivalence (Emch, 1972; Araki, 
1999; Wald, 1994), which allows to extend the formalism 
of POVMs and CP maps to general globally hyperbolic 
spacetimes (Terno, 2002). 

The simplest example of inequivalent representations 
occurs in the discussion of the Unruh effect, when we 
wish to use quantum field theory in the Rindler wedge 
X > \t\ where the detector moves, or in the opposite 
wedge X < — which is causally separated from it, or 
in both wedges together. Each one of the two wedges, or 
both together, can be considered as spacetimes on their 
own right (Rindler spaces), where a global timelike field 
is obtained from the set of all hyperbolas with diff'erent 
values of the acceleration (Wald, 1984). 

The transformation between Minkowski and Rindler 
wedge descriptions are unitary only formally (Unruh and 
Wald 1984; Wald 1994) and algebraic field theory should 
be used to give a rigorous interpretation to these for- 
mal expressions (Emch, 1972; Haag, 1996). A quantum 
field theory can be defined in a standard way because 
the Rindler spaces are globally hyperbolic. They admit 



a Cauchy surface for specifying initial values, whose do- 
main of development is the entire spacetimc (Hawking 
and Ellis, 1973; Wald, 1984 and 1994). The vacuum state 
\0r) obtained in this construction is called a Rindler vac- 
uum. It is a natural vacuum for observers who move on 
orbits like in Fig. 3, with difii'erent positive values of the 
acceleration a. 

As a consequence of the Reeh-Schlieder theorem, it 
follows that a Minkowski vacuum \Q) corresponds to a 
mixed state in the Rindler spacetimc. To relate the 
Minkowski and Rindler Hilbert spaces, fields in both 
wedges arc required. The relation between the standard 
Minkowski Fock space and a tensor product of Rindler 
Fock spaces is given by a formally unitary operator U, 
whose action on the Minkowski vacuum is 

oo 

U\Q) = JJ 51 exp(-n7ra;i/a)|nji,) O |njij), (71) 

i n—0 

where w, denotes the frequencies of the modes of the 
Rindler fields, and rii are the corresponding occupa- 
tion numbers. The above expression suggests that the 
Minkowski vacuum has the structure of a maximally 
entangled state when viewed by accelerated observers. 
When restricted to only one wedge, the state becomes 

oo 

P = '[\.^exp{-mnjJi/a)Z~^\niR}{niR\, (72) 

i n=0 

where the ith mode was normalized by Zi = 
exp(— riTTWi/a). That state indeed produces a ther- 
mal density matrix p oc exp(— iJ/j/T), where Hr is the 
field Hamiltonian for region R, and T = ah/2'!Tck^. We 
can now calculate the entanglement of the Minkowski 
vacuum as seen by an accelerated observer. A natural 
reduced density matrix is p itself, which is a singular 
state (in the sense of Sec. V.C) of an infinite thermal 
bath. Its entropy is infinite, which is in agrcmcnt with 
the previous discussion, since the energy of such a system 
is also infinite. 

The relationship between Minkowski and Rindler wave 
packets was analyzed by Audretsch and Miiller (1994a). 
These authors also discussed local detection by Rindler 
observers and EPR-like correlations (Audretsch and 
Miiller, 1994a, b). 

Alsing and Milburn (2002, 2003) examined the fidelity 
of teleportation from Alice in an inertial frame to Bob 
who is uniformly accelerated. Assume that qubits are 
realized by some mode lj of the electromagnetic field, 
and that Alice's state is |*) = a|n) -|- where |n) 
is the Minkowski vacuum. Then the best state that Bob 
can hope to get is 

I*') =a|0ii)+/3|lii), (73) 

where \Qr) is the Rindler vacuum, and some mode lo' (as 
seen by Bob) was chosen for his realization of qubits. The 
fidelity of teleportation |^) — > ^i/') then decreases with 
Bob's acceleration. It also depends on time: the fidelity 
of course vanishes when Alice is behind Bob's horizon. 
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B. The thermodynamics of black holes 

Black holes result from concentrations of matter so 
large that their gravity pull prevents the escape of light 
(Michell, 1784; Laplace, 1795). In other words, a fu- 
ture horizon is formed. While Unruh's horizons were for 
observers whose asymptotic speed approaches c, a black 
hole horizon affects every observer. We now present ba- 
sic facts of black hole physics, limiting ourselves almost 
exclusively to spherically-symmetric spacetimes. The lit- 
erature on black holes is voluminous, and our sketch gives 
just a glimpse of this fascinating subject. Our main 
sources for classical black hole physics were Landau and 
Lifshitz (1975), Hawking and ElUs (1973), Wald (1984) 
and Frolov and Novikov (1998). For quantum aspects, we 
consulted Birrell and Davies (1982), Wald (1994), Brout 
et al. (1995), and Frolov and Novikov (1998). An ex- 
tensive survey of black hole thermodynamics was given 
by Wald (1999, 2001). In this section, unless otherwise 
stated, c — G — h — 1. 

Spacetime outside a spherically symmetric distribu- 
tion of matter (and hence outside an incipient black 
hole during all stages of its collapse) is described by the 
Schwarzschild metric, 

ds^ = (1 _ 2M/r)df - (1 - 2M/r)-^dr^ - r'^dn'^. (74) 

The proper time of a stationary observer is dr = 
^/Ottdt = ^1 — 2M/rdt, and the radial distance is dl = 
\/—grrdr = (1 — 2M/r)~^/'^dr. This metric has a coordi- 
nate singularity at r = 2M, which can be removed by a 
transition to various alternative coordinate systems. As 
we shall sec, it is a kind of "boundary" of the black hole. 
On the other hand, the singularity at r = is physical: 
the spacetime curvature diverges there. 

Spacetimes may have symmetries. If translation along 
a family of curves leaves the metric invariant, the field of 
tangent vectors to these curves is called a Killing field 
(Killing, 1892). Killing vectors x** have many useful 
properties. 

For example, the Schwarzschild metric is invariant un- 
der time translations, t t + t. The corresponding 
Killing vector x'^ = (Ij 0, 0, 0) is timelike for r > 2M and 
spacelikc for r < 2M. It becomes null on the horizon. 
The surface gravity k, which characterizes the strength 
of gravitational field near the horizon, is defined as 



K = lim(aa). 



(75) 



where a is the norm of the proper four-acceleration of 
a stationary object, and a is a red-shift factor. For 
Schwarzschild black holes, a = ^fgtt and k — l/AM. 
Hawking and Ellis (1973), and Wald (1984, 1999) de- 
scribe many properties of the surface gravity. Bardeen, 
Carter, and Hawking (1973), have shown that k is con- 
stant over the horizon of any stationary black hole. This 
is known as a the zeroth law of black hole mechanics. 

Even in classical general relativity, there is a serious 
difficulty with the second law of thermodynamics when 



a black hole is present: if we drop ordinary matter into a 
black hole, it will disappear into a spacetime singularity, 
together with its entropy S. No compensating gain of 
entropy occurs, so that the total entropy in the universe 
decreases. One could attempt to salvage the second law 
by invoking the bookkeeping rule that one must continue 
to count the entropy of matter dropped into a black hole 
as still contributing to the total entropy of the universe. 
However, the second law would then be observationally 
unverifiable. 

It was noted by Bekenstein (1972, 1974) that proper- 
ties of the horizon area of a stationary black hole resemble 
those of entropy. In the most general case, a stationary 
black hole is characterized by three parameters: its mass 
M, angular momentum J and charge Q. The first law 
of black hole dynamics (Bardeen, Carter, and Hawking, 
1973; Iyer and Wald, 1994) states that 



dM 



-dA + fldJ + ^dQ, 



(76) 



where is the angular velocity and $ the electric poten- 
tial. This relation is formally identical to the first law of 

thermodynamics, if we identify temperature with surface 
gravity and entropy with horizon area. We would then 
have 



T = 



he" 



2n GK 



S = 



ill' 



(77) 



where 1^ = y/HG/c^ is the Planck length, and ordinary 
units were restored. 

Bekenstein (1972, 1974) proposed to assign to a black 
hole of area A an entropy 



S^^ = Ac^/AhG, 



(78) 



thus elevating a formal analogy to the status of a physical 
law. Hawking (1974) found that a black hole radiates like 
a black body at temperature T, and thereby put the anal- 
ogy between black hole mechanics and thermodynamics 
on firm ground. 

There are many ways to explain Hawking radiation 
(Hawking, 1975; Wald, 1975; Birrell and Davies, 1982; 
Fredenhagen and Haag, 1990; Wald, 1994; Brout et al, 
1995) . Here, we follow the informal presentation of Frolov 
and Novikov (1998), which is based on the analogy with 
pair creation by an external static field. Actually, space- 
time is not static when a star collapses into a black hole 
and later evaporates. However, usually it is an excellent 
approximation to treat it as static. A rigorous analysis 
along these lines was made by Brout et al. (1995). 

Similarities between pair production and Hawking ra- 
diation were discussed by Miiller, Greiner and Rafeski 
(1977). Let r be the field strength and g the charge. 
By analogy with the tunnel effect, the probability that a 
virtual pair of particles be found at a distance I from 
one another is approximately e~^l^, where A is the 
Compton wavelength. A pair may turn to be real if 
gVl > 2mc^ . Thus, the probability of particle creation is 
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w (X exp{—<^m^c^/hgT), where the numerical constant 
can be obtained by a more detailed calculation. 

A naive application of this formula to particle creation 
in a static gravitational field turns out to give not only 
the right result, but also some valuable insights. In par- 
ticular, conservation of energy implies that a static gravi- 
tational field can create particles only if there are regions 
with timelike Killing fields and others with spacelike ones; 
a horizon is needed. A static gravitational field with- 
out horizons cannot create particles (Birrell and Davies, 
1982; Wald, 1984). A black hole emits particles as if it 
were a black body with temperature 

T = K/27rfc3, (79) 

as in Eq. (|77jl . 

The generalized second law of thermodynamics 
(Bekenstein, 1974; Frolov and Page, 1993; Wald, 1994; 
Frolov and Novikov, 1998) states that 

AS + A5bh > 0. (80) 

An informational analysis of this law by Hosoya, Carlini, 
and Shimomura (2001) clarified its relation to classical 
bounds on accessible information (Levitin, 1969, 1987; 
Holevo, 1973). Bekenstein and Mayo (2001) and Beken- 
stein (2002) gave a description of the information absorp- 
tion and emission by black holes in terms of quantum 
channels. 

A natural question is what (and where) are the de- 
grees of freedom responsible for the black hole entropy. 
On this issue, there are conflicting views. It is not clear 
whether we should think of these degrees of freedom as 
residing outside the black hole in its thermal atmosphere, 
or on the horizon in Chern-Simons states, or inside the 
black hole, associated with what classically corresponds 
to the singularity deep within it. Or perhaps the micro- 
scopic origin of S'^^ is the entanglement between Hawk- 
ing particles inside and outside the horizon (Bombelli 
et ai, 1986; Ashtekar et ai, 1994; lorio, Lambiase, and 
Vitiello, 2001). It is likely that in order to gain a bet- 
ter understanding of the degrees of freedom responsible 
for black hole entropy, it will be necessary to achieve 
a deeper understanding of the notion of entropy itself 
(Zurek, 1990). 

Suppose now that the matter that has fallen inside the 
horizon had quantum correlations with matter that re- 
mained outside. How is such a state described by quan- 
tum theory? Are these correlations observable? This 
problem is not yet fully understood, although such cor- 
relations play an essential role in giving to Hawking ra- 
diation a nearly exact thermal character (Wald, 1975). 
It is hard to imagine a mechanism for restoring the cor- 
relations during the process of black hole evaporation. 
On the other hand, if the correlations between the inside 
and the outside of a black hole are not restored during 
the evaporation process, then by the time that the black 
hole has evaporated completely, an initial pure state will 
have evolved to a mixed state, and some "information" 
will have been lost. 



Hawking's radiation resolved the thermodynamic diffi- 
culty only to introduce another puzzle. An inevitable re- 
sult of that radiation is the evaporation of the black hole 
after a finite time (see Appendix B). Since the emitted 
particles are overwhelmingly massless, black hole evapo- 
ration leads to baryon number non-conservation. 

Hawking (1976, 1982) also introduced a superopera- 
tor to describe the quantum state evolution during the 
black hole formation and evaporation (see Appendix B). 
A detailed analysis of this superoperator was made by 
Strominger (1996). It is (at least formally) completely 
positive and as such it is a perfectly normal operation of 
quantum information theory (Terno, 2002). 

Yet, it has often been asserted that the evolution of an 
initial pure state into a final mixed state confiicts with 
quantum mechanics, and this issue is usually referred to 
as the "black hole information loss paradox." These pes- 
simistic views are groundless. When black hole thermo- 
dynamics appeared in the 70's, notions such as POVMs 
and completely positive maps were unknown to the rel- 
ativistic community. Today, we know that the evolution 
of pure states into mixtures is the general rule when a 
classical intervention is imposed on a quantum system, 
as we have seen in Sec. II. In the present case, the clas- 
sical agent is the spacetime metric itself, which is bor- 
rowed from classical general relativity in the absence of 
a consistent quantum gravity theory. Attempts to in- 
troduce a hybrid quantum-classical dynamics by using 
the Koopman (1931) formalism are not mathematically 
inconsistent, but they violate the correspondence prin- 
ciple and are physically unacceptable (Peres and Terno, 
2001). Anyway, the evolution of an initial pure state into 
a final mixed state is naturally accomodated within the 
framework of the algebraic approach to quantum theory 
(Wald, 1994), and that of a generalized quantum theory 
(Hartle, 1998). 

The final fate of black holes and its relation to the in- 
formation paradox were discussed by Preskill (1993), 't 
Hooft (1996, 1999) and Frolov and Novikov (1998). How- 
ever, this issue may be conclusively resolved only after 
there is a consistent theory of quantum gravity, allow- 
ing meanwhile for a number of tantalizing speculations. 
Here we present five of the most popular alternatives of 
what happens with the "information" when a black hole 
evaporates. 

• Information is lost: Hawking's superscattering that 
was described above is a fundamental feature of 
quantum theory and not just an effective descrip- 
tion. 

• There is no information loss: if the spectrum is an- 
alyzed carefully, there may be enough non-thermal 
features to encode all the information. Bekenstein 
(1993) showed that deviations of the Hawking ra- 
diation from the black body spectrum may help 
reconstruct part of the information. Hod (2002) 
estimated that, under suitable assumptions about 
black hole quantization, the maximal information 
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emission rate may be sufficient to recover all the in- 
formation from the resulting discrete spectrum of 
the radiation. 

• Information comes out at the end, at the Plank 
scale physics. Frolov and Vilkovisky (1981) con- 
structed a model that provides for this possibility. 

• There is a stable black hole remnant with about the 
Planck mass (0.02 ng) and information is somehow 
encoded in it (Aharonov, Casher, and Nussinov, 
1987). 

• Information escapes to baby universes, that are cre- 
ated instead of true singularities (Zel'dovich, 1977; 
Hawking, 1988). The overall evolution of the entire 
multiverse is unitary, but since baby universes are 
causally unconnected to our universe and the total 
state is entangled, we perceive a loss of information. 

Still a different scenario is implied by the works of Ger- 
lach (1976) and Boulware (1976): a particle that falls 
into an eternal black hole crosses the horizon after an in- 
finite amount of the coordinate time t, but only a finite 
amount of its own proper time. On the other hand, the 
evaporation of a black hole takes a finite amount of the 
coordinate time, which is the physical time of a distant 
observer (see Appendix B). From the point of view of 
the infalling observer, the horizon always appears to re- 
cede before her, until it finally disappears (or shrinks to 
the Planck scale) and the region "beyond the horizon" is 
unattainable. The distant observer sees the infalling one 
quickly arrive arbitrarily close to the effective horizon, 
then she is nearly "frozen" there for an exceedingly long 
time, and finally either the black hole evaporates or the 
universe collapses. Therefore it makes no sense to assert 
that states having (essential) support on the part of the 
Cauchy surface that lies beyond the horizon would be 
correlated with an outgoing Hawking radiation and then 
mysteriously disappear. There is no issue of information 
loss at all (Sonego, Almergren, and Abramowitz, 2000; 
Alberghi et al., 2001). 

C. Open problems 

The good news are that there is still plenty of work 
to be done. Here we shall mention a few problems that 
appear interesting and from which more physics can be 
learnt. 

• As mentioned in Sec. lV.Bl quantum field theory im- 
plies a trade-off between the reliability of detectors 
and their localization. This is an important prac- 
tical problem. A proper balance must be found 



We have listed this opinion last, because it is the one we tend to 
support. 



between the loss of undetected signals, false alarms 
(dark counts), and our knowledge of the location of 
recorded events. A quantitative discussion of this 
problem would be most welcome. 

• It is possible to indicate the approximate orienta- 
tion of a Cartesian frame by means of a few suitably 
prepared spins (Bagan, Baig, and Muiioz-Tapia, 
2001), or even a single hydrogen atom (Peres and 
Scudo, 2001). Likewise, the quantum transmission 
of the orientation of a Lorentz frame should be 
possible. This problem is much more difficult, be- 
cause the Lorentz group is not compact and has no 
finite-dimensional unitary representations (Wigner, 
1939). 

• Progressing from special to general relativity, what 
is the meaning of parallel transport of a spin? In a 
curved spacetime, the result is obviously path de- 
pendent. Then what does it mean to say that a 
pair of distant particles is in a singlet state? As 
the rotation group 0(3) is not a valid symmetry, 
the classification of particles, even the usefulness of 
the concept of a particle, become doubtful. Meth- 
ods are known for quantization of higher spin fields 
in a curved background (Birrell and Davies, 1982; 
Wald, 1994), but what is the operational meaning 
of the resulting states and POVMs? 

• We still need a method for detection of relativistic 
entanglement that involves the spacetime proper- 
ties of the quantum system, such as a combination 
of localization and spin POVMs (in flat or curved 
metric backgrounds). 

• After all these problems have been solved, we'll still 
have to find a theory of the quantum dynamics for 
the spacetime structure. 
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APPENDIX A: Relativistic states transformations 

In this Appendix we list the conventions we used and 
outline the transformation rules for free particle states. 
Details can be found in the treatises of Bogolubov, Lo- 
gunov and Todorov (1975), and Weinberg (1995). Ex- 
plicit forms of the transformation laws for massive parti- 
cles are given by Halpern (1968), Bogolubov et a/.(1975). 
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and Ahn et al. (2003); for massless particles, by Lind- 
ner, Peres, and Terno (2003) and Bcrgou, Gingrich, and 
Adami (2003). 

Unless stated otherwise, we chose the following con- 
ventions for states and related operators: 



(Al) 



and 



{aMLq) = (27r)3(2/)5.55(3)(p „ (^2) 



where p° = E{p) ~ \J ni^ + . One-particle states are 

i/ja{p)\a,p)dn{p), (A3) 



E 



(A4) 



with the Lorentz-invariant measure 

dfi{p) = d^p/{2nf{2p°). 
The wave functions j^*) satisfy 

{a,p\^)^Mp), (A5) 

and 



'4'*a{p)4>a{p)dn{p). 



(A6) 



If we want to be more explicit about the spin degrees 
of freedom, we use 2-spinor notations: a pure state of 
definite momentum and arbitrary spin is (^) \p) . The one- 
to-one correspondence with Dirac's notation is explained 
by Bogolubov, Logunov and Todorov (1975). 

Under a classical, geometric Lorentz transformation 
yi^i- — K^^^x^ ^ the unitary transformation of the basis vec- 
tors (A.l) is 

U{K)\cj,p)=Y,DaW{K,pm,kp), (A7) 

where D^a- are matrix elements of the unitary operator D 
that corresponds to the Wigner rotation M^(A,p), given 
by Eq. (|X8|) below. 

Note that the spin rotation depends on the value of the 
momentum (spin is a secondary variable, as defined in 
Sec. IV). The quantum circuit in Fig. 4 gives a graphical 
representation of primary vs. secondary variables. 

The Wigner rotation matrix is given by 



W{K,p) -.^ L-\Kp)KL{p), 



(AS) 



where Lijj) is a "standard boost" which transforms a 
"standard four-momentum" ks into p. For massive par- 
ticles ks = (m, 0,0,0), while for massless ones it is 



ks = (1,0,0,1). Explicit formulas for L{p) in the mas- 
sive and massless cases are given in the books of Halpern 
(1968), Bogolubov et al. (1990), and Weinberg (1995). 

Wave functions having a distribution of momenta 
transform as 

^^(9) = {Lq\U{A)Y^ fd^l{p)Mp)W,P), 

a '' 

= J2 ldii{p)Mp)Dx.[W{K,pm,q\xAp). 
= Y,Dia[WiA,A-'p)]MA-'p), (A9) 

SO the same state in the boosted frame is 
I*') =E r D„dW{A,A-'p)]^^{A-'p)\a,p)dfi{p). 

(AlO) 

Explicit expressions for D \W] are given in Section IV and 
in the references cited above. 
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FIG. 4 Relativistic state transformation as a quantum cir- 
cuit: the gate D which represents the matrix D^ct[VF(A,p)] 
is controlled by both the classical information and the mo- 
mentum p, which is itself subject to the classical information 
A. 



APPENDIX B: Black hole radiation 

The energy radiated by a black hole satisfies approxi- 
mately the Stefan-Boltzmann law, (Frolov and Novikov, 
1998; Brout et al., 1995) so the rate of mass loss due to 
energy conservation is 



(Bl) 



where A is the horizon area and time is that of a dis- 
tant observer. It can be shown that a relation T (x M^^ 
holds in quasi-static changes of mass at all stages of evap- 
oration. Numerical coefficients were calculated by Page 
(1976). A back hole of initial mass Mq (not too small) 
evaporates after a time 



tE = aM^, 



(B2) 



where a = 4.9 x 10 ^sec/kg^. Together with HBlfl . this 
gives the following expression for the mass 



This representation was suggested to us by Barbara Terhal. 



M(t) =Mo(l-tA£)i/^ 



(B3) 
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The duration of the steady-state radiation build-up is in- 
comparably shorter than (Wald, 1994; Brout et ai, 
1995), so that the above expression is a good approxi- 
mation. Hence it takes a time comparable to the age of 
the universe for a black hole of mass 5 x lO^'^g (and ra- 
dius of atomic size) to evaporate completely (Frolov and 
Novikov, 1998). 

Hawking (1976, 1982) introduced a superoperator 
(originally called "superscattering operator"), that was 
mentioned in Sec. IVl.BI to describe the quantum state 
evolution during the black hole formation and evapora- 
tion. In standard scattering theory, a unitary S'-matrix 
relates the density matrix of final states with to of the 
incoming states: — Sp"^S^ . For a spacetime with an 
evaporating black hole, S would map states from Tiin (the 
states in the distant past, when the black hole did not 
exist yet) to the tensor product of Tiout (the states that 
reach infinity and are accessible to a distant observer) 
and the Hilbert space of states that fell into the black 
hole. This splitting is a standard step in many deriva- 
tions of the Hawking radiation (Wald, 1994). Since only 
the states that reach infinity are accessible to a distant 
observer, the final density matrix is calculated by tracing 
out the black hole, 

p™* ^ tr Sp'^'Sl (B4) 
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